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IMMUNIZATION FOR EBOLA VIRUS INFECTION 

"FIELD OF THE INVENTION 

The present invention relates generally to viral vaccines and, more particularly, 
to Ebola virus vaccines and methods of protecting against disease caused by infection 
with Ebola virus. 

BACKGROUND OF THE INVENTION 

The Ebola viruses, and the genetically-related Marburg virus, are filoviruses 
associated with outbreaks of highly lethal hemorrhagic fever in humans and primates 
in North America, Europe, and Africa. Peters, C.J. et al., Filoviridae: Marourg and 
Ebola Viruses, in Fields Virology, (eds., Fields, B.N.. Knipe. D.M.& Howley. P.M.) 
1161-1176 (Philadelphia, Lippincott-Raven. 1996); Peters, C.J. etal , Semin. Viml. 
5:147-154 (1994). Ebola viruses are negative-stranded RNA viruses comprised of 
four subtypes, including those described in the Zaire, Sudan, Reston, and Ivory Coast 
episodes. Sanchez, A. et al., PNAS (USA) 93:3602-3607 (1996). Although several 
1 5 subtypes have been defined, the genetic organization of these viruses is similar, each 
containing seven linearly arrayed genes. Among the viral proteins, the envelope 
glycoprotein exists in two alternative forms, a 50-70 kilodalton <kDa) secreted protein 
of unknown function encoded by the viral genome and a 130 kDa transmembrane 
glycoprotein generated by RNA editing that mediates viral entry. Peters. C.J.^et al., 
20 Filoviridae: Marburg and Ebola Viruses, in Fields Virology. <eds.. Fields, B.N., Knipe* 
D.M.& Howley. P.M.) 1161-1176 (Philadelphia. Lippincott-Raven. 1996); Sanchez, A. 
etal.. PNAS (USA) 93:3602-3607 (1996). Other structural geneproducts include the 
nucleoprotein (NP). matrix proteins VP24 and VP40. presumed nonstructural proteins 
VP30 and VP35, and the viral polymerase (reviewed in Peters. C.J.-et ai, Filoviridae: 
25 Marburg and Ebola Viruses, in Fields Virology. <eds.. Fields. B.N.. Knipe. D.M.& 
Howley. P.M.) 1161-1176 (Philadelphia. Lippincott-Raven. 1996)). Although 
spontaneous variation of its RNA sequence does occur in nature, there appearstobe 
less nucleotide polymorphism within €bola subtypes than among other RNA viruses 
(Sanchez. A. et al.. PNAS (USA) 93:3602-3607 (1 996)). suggesting that immunization 
30 may be useful in protecting against this disease. Previous attempts to elicit protective 
immune responses against E-bola virus using traditional active and passive 
immunization approaches have, however, not succeeded. Paters. C.J. -et al.. 
Filoviridae: Marburg andEbola Viruses, in Fields Virology. <eds.. Fields. B.M.. Knipe. 
D.M.& Howley. P.M.) 1 161-1 176 Philadelphia. LippincotMRaven, 1996); Clegg. J.C.S. 
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et al., New Generation Vaccines. (eds. T Levine, M.M., Woodrow, G.C., Kaper, J.B.& 
Cobon, G.S.) 749-765 (New York, NY, Marcel Dekker, Inc. 1997); Jahrling, P.B. et al., 
Arch. Virol. Suppl. 11:135-140 (1996). 

It would thus be desirable to provide a vaccine to protect against disease 
5 caused by infection with Ebola virus. It would further be desirable to provide methods 
of making and using said vaccine. 

SUMMARY OF THE INVENTION 
Ebola virus vaccines comprising nucleic acid molecules encoding Ebola viral 
proteins are provided. In one embodiment, the nucleic acid molecule encodes the 
10 transmembrane form of the viral glycoprotein (GP). In another embodiment, the 
nucleic acid molecule encodes the secreted form of the viral glycoprotein (sGP). In 
yet another embodiment, the nucleic acid molecule encodes the viral nucleoprotein 
(NP). 

The present invention also provides methods for immunizing a subject against 

15 disease caused by infection with Ebola virus comprising administering to the subject 
an immunoeffective amount of an Ebola virus vaccine. Administration can be by any 
of the routes normally used for gene therapy. In a preferred method, the Ebola virus 
vaccine is administered by intramuscular injection. The genetic immunization 
methods of the present invention provide protective immunity against disease caused 

20 by infection with Ebola virus. 

Additional objects, advantages, and features of the present invention will 
become apparent from the following description and appended claims, taken in 
conjunction with the accompanying drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

25 The various advantages of the present invention will become apparent to one 

skilled in the art by reading the following specification and subjoined claims and by 
referencing the following drawings. 

Figures 1A and 1B are photographs showing expression of Ebola virus gene 
products in eukaryotic plasmid expression vectors. 

30 Figure 1A. Expression vectors encoding the indicated viral gene products 

under regulation of the CMV immediate-early region 1 enhancer and promoter were 
prepared and transfected into 293 cells as previously described. Manthorpe, M. et 
al. Hum. Gene. Ther. 4:419-431 (1993); Sambrook, J. t Fritch, E.F., & Maniatis, T. 
Cold Spring Harbor, N.Y. Cold Spring Laboratory Harbor Press, 1994. Cell extracts 

35 were prepared and analyzed by Western blot analysis for NP (left) or GP (right) using 
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relevant rabbit antisera and a secondary antibody, horseradish peroxidase conjugated 
donkey anti-rabbit IgG of a dilution of 1 :5,000. Incubation with primary antibody was 
for 30 minutes at room temperature, and for 30 minutes at room temperature with 
secondary antibody. Immunocomplexes were then detected by chemiluminescence 
using super signal substrate reagents (Pierce) according to manufacturer's 
instructions. 

Figure 1B. Generation of antibody response in mice immunized with the 
indicated vectors and analyzed by Western blot for NP, GP, and sGP as shown. 
Antisera from mice were tested at a dilution of 1:500 <NP), 1:50 (GP), or 1:50 <sGP), 
respectively, and developed with a secondary antibody <sheep anti-mouse, 1:5,000, 
Amersham Life Science) and chemiluminescence as in Figure 1A. The control vector 
used for immunization represents the expression vector plasmid with no insert. 
Manthorpe, M. et al., Hum. Gene. Ther. 4:413-431 (1993). 

Figures 2A-2D are graphs showing the immune responses to NP andGP after 
15 genetic immunization in mice. 

Figure 2A. Splenic lymphocytes from vector or NP-plasmid immunized mice 
were isolated approximately 6 weeks after the initial immunization and sensitized in 
vitro for 5 days with 10 U/ml hlL-2. Renca-NP ceHs sensitized splenocytes from 
vector-immunized or pCMV-NP immunized mice were used to -detect -CTL activity at 
the indicated effectortarget ratios on Renca or Renca-NP cells (left, middle) or with 
allogeneic effector cells with Renca-NP to show that they are susceptible to lysis 
(right). Allogeneic effector cells were generated oy incubating cells derived from mice 
with a C57BI/6 background (5x10 6 /ml) with irradiated Bato/c spleen cells (5x10*/ml) 
in the presence of IL-2 (20 U/ml) for five days. Thechromium release CTL assay with 
Renca-NP cells was performed in triplicate as previously described. Ohno, T. et al.. 
Gene. Ther. 4:361-366 (1997). 

Figure 2B. Balb/C female mice were immunized with the sGP plasmid 
expression vector and analyzed for their ability to lyse the syngeneic Renca cell line 
stably expressing GP. Isolation of stable transfectants. confirmation of expression, 
and CTL assay were performed as described (see. Specific Example, II. Methods). 
Renca-GP or sGP sensitized splenocytes from pCMV-GP orpCMV-sGP immunized 
mice were used to determine the specific killing of s, chromium labeled -Renca-GP cells 
at the indicated E/T ratios. 
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Figure 2C. Mice immunized with GP were analyzed for their ability to lyse a 
syngeneic CT26 cell stably expressing GP or CT26 vector control transduced line at 
the indicated E/T ratios. 

Figure 2D. Cellular proliferative response in the indicated immunized mice. 
5 T cells, enriched or depleted (see. Specific Example, II. Methods), were incubated at 
10 5 cells/ml with sGP condition media (25%). Background was determined with cells 
incubated in media from control transfected 293 cells and subtracted from proliferation 
seen in sGP-containing supernatants. 

Figures 3A-3C are graphs showing immunization with sGP or GP expression 
10 plasmids induces T cell responses to sGP in guinea pigs. 

Figures 3A-3C. Cell-mediated immunity in guinea pigs was analyzed by 
performing assays to detect cell proliferation to control or GP antigen (A) or T-cell 
growth factor production in response to the indicated antigens. The culture 
supernatants containing these antigens were prepared as previously described 
15 (Bottomly, K. et al„ Measurement of human and murine interleukin 2 and interleukin 
4. in Current Protocols in Immunology, (eds., Coligan, J.E., Kruisbeek, A.M., 
Margulies, D.H., Shevach, E.M.& Strober, W.) 6.3.1-6.3.12 (New York, John Wiley & 
Sons, Inc. 1992); Arai, H. et al M Nat Med. 3:843-848 (1997)), and included at a final 
concentration of 10% (volume/volume). In A, cell numbers refer to the concentration 
20 of spleen cells per ml in the 3 H-thymidine proliferation assay. In B, supernatants from 
A, harvested at the time of the peak proliferative response to sGP, were incubated 
with primary guinea pig T cells maintained in 200 U/ml of human IL-2. The percent 
maximal response refers to the magnitude of stimulation in response to the indicated 
stimuli relative to supernatants from 24 hour concanaval (in A-stimulated cells (2 
25 //g/ml)). The requirement of T lymphocytes in guinea pig spleen cells for the 
proliferative response to sGP, performed as described in Specific Example, II. 
Methods, is shown (C). 

Figures 4A-4F are photographs showing the immunohistochemical analysis of 
Ebola virus antigens in liver, lung, and spleen from representative protected (GP- 
30 animal 3) or infected (vector-animal 2) guinea pigs. 

Figures 4A-4F. Magnification: liver, 40x; lung, 20x; spleen, 20x. 
Figure 5 is a schematic of the plasmid pVR 1012-GP<IC) (Ivory Coast strain 
of GP.SEQ ID NO: 1). 

Figure 6 is a schematic of the plasmid pVR 1012-GP(S) <Sudan strain of GP, 
35 see SEQ ID NO: 2). 
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Figure 7 is a schematic of the plasmid pVR 1012-<3P<Z) {Zaire strain of GP, 
see SEQ ID NO: 3). 

Figure 8 is a schematic of the plasmid pVR 1 01 2-sGP<Z) {Zaire strain of sGP, 
see SEQ ID NO: 4). 

5 Figure 9 is a schematic of the plasmid pVR 1012-NP. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Ebola virus vaccines are provided comprising a nucleic acid molecule encoding 
an Ebola viral protein operatively-linked to a control sequence in a pharmaceutical^ 
acceptable carrier. Jn one embodiment, the nucleic acid molecule encodes the 
10 transmembrane form of the viral glycoprotein (<3P). In another embodiment, the 
nucleic acid molecule encodes the secreted form of the viral glycoprotein <sGP). In 
yet another embodiment, the nucleic acid molecule encodes the viral nucleoprotein 
(NP). 

The present invention further includes vaccines comprising nucleic acid 
molecules encoding Ebola viral proteins other than GP, sGP. and NP, e.g., other 
structural gene products which elicit protective immunity from disease caused by 
infection with Ebola virus. The nucleic acid molecules of the vaccines of the present 
invention encode structural gene products of any Ebola viral strain including the Zaire, 
Sudan, Ivory Coast and Reston strains. Nucleic acid molecules encoding structural 
gene products of the genetically-related Marburg virus strains may also be employed. 
Moreover, the nucleic acid molecules of the present invention may -be modified, e.g., 
the nucleic acid molecules set forth herein may be mutated, as long as -the modified 
expressed protein elicits protective immunity from disease caused by infection with 
Ebola virus. For example, the nucleic acid molecule may be mutated so that the 
25 expressed protein is less toxic to cells. The present invention also includes vaccines 
comprising a combination of nucleic acid molecules. "For example, and without 
limitation, nucleic acid molecules encoding <3P, sGP and NP of the Zaire. Sudan and 
Ivory Coast Ebola strains may be combined in any combination, in one vaccine 
composition. 

The present invention alsoprovides methods for immunizing a subject against 
disease caused by infection with Ebola virus comprising administering to the subject 
an immunoeffective amount of an Ebola virus vaccine. Methods of making and using 
Ebola virus vaccines are also provided by the present invention including the 
preparation of pharmaceutical -compositions. 
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As referred to herein, the term "encoding" is intended to mean that the subject 
nucleic acid may be transcribed in a cell, e.g., when the subject nucleic acid is linked 
to appropriate control sequences such as a promoter in a suitable vector (e.g., an 
expression vector) and the vector is introduced into a cell. The nucleic acid 
5 molecules of the present invention may be DNA molecules, cDNA molecules or RNA 
molecules, and are preferably cDNA molecules. The term "operatively-linked" as 
used herein refers to functional linkage between a nucleic acid expression control 
sequence (such as a promoter) and a second nucleic acid sequence, wherein the 
expression control sequence directs transcription of the nucleic acid corresponding 
10 to the second sequence. Expression control sequences are known to those skilled 
in the art (see, e.g., Goeddel, Gene Expression Technology: Methods in Enzymology 
185, Academic Press. San Diego, CA (1990)). Vectors which contain both a promoter 
and a cloning site to which an inserted piece of nucleic acid is operatively-linked to 
the promoter, are well known in the art and are generally referred to herein as 
1 5 "expression vectors" or "expression vector plasmids". Preferably, these vectors are 
capable of transcribing nucleic acid in vitro and in vivo. A preferred vector is the 
cytomegalovirus (CMV) expression vectorwhich directs high levels of gene expression 
in muscle. 

Nucleic acid molecules which hybridize under stringent conditions to the 
20 nucleic acid molecules described herein are also within the scope of the present 
invention. As will be appreciated by those skilled in the art, multiple factors are 
considered in determining the stringency of hybridization including species of nucleic 
acid, length of nucleic acid probe, T m (melting temperature), temperature of 
hybridization and washes, salt concentration in the hybridization and wash buffers. 
25 aqueous or formamide hybridization buffer, and length of time for hybridization and 
for washes. An example of stringent conditions are DNA-DNA hybridization with a 
probe greater than 200 nucleotides in 5 x SSC, at 65'C in aqueous solution or 42°C 
in formamide, followed by washing with 0.1 x SSC, at 65°C in aqueous solution. 
(Other experimental conditions for controlling stringency are described in Maniatis. T. 
30 et al., Molecular Cloning: a Laboratory Manual, Cold Springs Harbor Laboratory, Cold 
Springs, N.Y. (1982) at pages 387-389 and Sambrook. J. et al., Molecular Cloning: 
a Laboratory Manual, Second Edition, Volume 2, Cold Springs Harbor Laboratory, 
Cold Springs, N.Y., at pages 8.46-8.47 (1989)). 

It will be appreciated that administration of the vaccines of the present 
35 invention can be by any of the routes normally used for gene therapy. In a preferred 
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method, administration is by intramuscular injection, however, other procedures for 
transfecting cells may also be employed, such as transfection using calcium 
phosphate coprecipitation, liposome-mediated transfection, plasmid and viral vector- 
mediated transfection and DNA protein complex-mediated transfection. Viral vector- 
5 mediated transfection includes, without limitation, the use of retroviral, replication- 
deficient retroviral, adenoviral and adeno-associated viral vectors. Cells transfected 
by the vaccines in the context of ex vivo gene therapy can also be administered. 

It will be appreciated that more than one route of administering the vaccines 
of the present invention may be employed either simultaneously or sequentially <e.g., 
10 boosting). In addition, the vaccines of the present invention may be employed in 
combination with traditional immunization approaches such as employing protein 
antigens, vaccinia virus and inactivated virus, as vaccines. Thus, in one embodiment, 
the vaccines of the present invention are administered to a subject <the subject is 
"primed" with a vaccine of the present invention) and then a traditional vaccine is 
15 administered (the subject is "boosted" with a traditional vaccine). In another 
embodiment, a traditional vaccine is first administered to the subject followed by 
administration of a vaccine of the present invention. In yet another embodiment, a 
traditional vaccine and a vaccine of the present invention are co-administered. 

Immunogenicity may be significantly improved if the vaccines of the present 
20 invention are co-administered with an immunostimulatory agent or adjuvant. 
Adjuvants enhance immunogenicity but are not necessarily immunogenic themselves. 
Immunostimulatory agents or adjuvants have -been used for many years to improve 
the host immune responses to, for example, vaccines. Adjuvants may thus be 
employed to enhance the immunogenicity of the vaccines of the present invention, as 
25 well as the immunogenicity of traditional vaccines. Suitable adjuvants are well known 
to those skilled in the art and include, without limitation, aluminum phosphate, 
aluminum hydroxide. QS21. Quil A. derivatives and components thereof, calcium 
phosphate, calcium hydroxide, zinc hydroxide, aglyeolipid analog, an cctodecyl -ester 
of an amino acid, a muramyl dipeptide, polyphosphaz-ene. a lipoprotein. ISCOM 
30 matrix. DC-Choi, DDA. and other adjuvants and oaoterial toxins, components and 
derivatives thereof. 

The vaccines of the present invention may also oe co-administered with 
cytokines to further enhance immunogenicity. The cytokines may be administered by 
methods known to those skilled in the art. e.g., as a nucleic acid motecule in plasmid 
35 form or as a protein or fusion protein. 
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Upon inoculation with a pharmaceutical composition as described herein, the 
immune system of the host responds to the vaccine by producing antibodies, both 
secretory and serum, specific for Ebola virus proteins. As a result of the vaccination, 
the host becomes at least partially or completely immune to Ebola virus infection, or 
5 resistant to developing moderate or severe disease caused by Ebola virus infection. 
Although Ebola virus infection and disease caused thereby are discussed in detail 
herein, it will be appreciated that the vaccines and methods of the present invention 
may be employed to immunize a subject against hemorrhagic fever generally, such 
as that caused by infection by the genetically-related Marburg virus. 
1 0 Pharmaceutical compositions comprising the nucleic acid molecules encoding 

Ebola viral proteins described herein, either alone or in combination, and a 
pharmaceutical^ acceptable carrier, are also provided by the present invention. As 
used herein, the phrase "pharmaceutically acceptable carrier" encompasses any of 
the standard pharmaceutical carriers, such as those suitable for parenteral 
15 administration, such as, for example, by intramuscular, intraarticular (in the joints), 
intravenous, intradermal, intraperitoneal, and subcutaneous routes. Examples of such 
formulations include aqueous and non-aqueous, isotonic sterile injection solutions, 
which contain antioxidants, buffers, bacteriostats, and solutes that render the 
formulation isotonic with the blood of the intended recipient, and aqueous and non- 
20 aqueous sterile suspensions that can include suspending agents, solubilizers, 
thickening agents, stabilizers, and preservatives. 

Formulations suitable for oral administration can consist of (a) liquid solutions, 
such as an effective amount of the vaccine dissolved in diluents, such as water, saline 
or PEG 400; (b) capsules, sachets or tablets, each containing a predetermined 
25 amount of the vaccine, as liquids, solids, granules or gelatin; (c) suspensions in an 
appropriate liquid; <d) suitable emulsions; and (e) polysaccharide polymers such as 
chitians. The vaccine, alone or in combination with other suitable components, may 
also be made into aerosol formulations to be administered via inhalation, e.g., to the 
bronchial passageways. Aerosol formulations can be placed into pressurized 
30 acceptable propellants, such as dichlorodifluoromethane, propane, nitrogen, and the 
like. 

Suitable formulations for rectal administration include, for example, 
suppositories, which consist of the vaccine with a suppository base. Suitable 
suppository bases include natural or synthetic triglycerides or paraffin hydrocarbons. 
35 In addition, it is also possible to use gelatin rectal capsules which consist of a 
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combination of the vaccine with a base, including, for example, liquid triglycerides, 
polyethylene glycols, and paraffin hydrocarbons. 

Each carrier must be "acceptable" in the sense of being compatible with the 
other ingredients of the formulation and not injurious to the recipient, e.g., the patient. 
5 The formulations can be presented in unit-dose or multi-dose sealed containers, such 
as ampules or vials and may be prepared by any method known in the art. 

Pharmaceutical compositions comprising any of the nucleic acid molecules 
encoding Ebola viral proteins of the present invention are useful to immunize a 
subject against disease caused by Ebola virus infection. Thus, this invention further 
10 provides methods of immunizing a subject against disease caused by Ebola virus 
infection, e.g., hemorrhagic fever, comprising administering to the subject an 
immunoeffective amount of a pharmaceutical composition of the invention. This 
subject may be an animal, for example a mammal, such as a primate or preferably 
a human. *. 

15 The vaccines of the present invention are also suitable for veterinary 

immunization. The vaccines of the present invention comprising nucleic acid 
molecules encoding Ebola virus structural -gene products from the Reston strain, 
which is known to infect animals, are particularly useful in such veterinary 
immunization methods. 

20 The vaccines are administered in a manner compatible with the -dosage 

formulation, and in such amount as will be therapeutically effective, immunogenic and 
protective. The quantity to be administered depends on the subject to be treated, 
including, for example, the capacity of the immune system of the individual to 
synthesize antibodies, and. if needed, to produce a cell-mediated immune response. 

25 Precise amounts of active ingredient required to be administered depend on the 
judgment of the practitioner and may be monitored on a patientby-patient basis. 
However, suitable dosage ranges are readily determinable by one skiHed in the art 
and generally range from about 300 fig to about 4-5 mg. The oosage may also 
depend, without limitation, on the route of administration, the patient's stateof health 

30 and weight, and the nature of the formulation. 

Methods of immunizing a subject against multiple strains of€bola virus are 
further provided herein. The nucleic acid molecules encoding Ebola viral proteins of 
the present invention may be combined with nucleic acid motecutes encoding other 
viral proteins of other virus strains to achieve protection against multiple strains of 
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Ebola vinjs. Typically the vaccines will be in an admixture and administered 
simultaneously, but may also be administered separately. 

In some instances it may be desirable to combine the Ebola virus vaccines of 
the present invention with vaccines which induce protective responses to other 
5 agents, particularly other viruses. For example, the vaccine compositions of the 
present invention can be administered simultaneously, separately or sequentially with 
other genetic immunization vaccines such as those for influenza (Ulmer, J.B. et al., 
Science 259:1745-1749 (1993); Raz, E. et al., PNAS (USA) 91:9519-9523 (1994)), 
malaria (Doolan, D.L. et al., J. Exp. Med. 183:1739-1746 (1996); Sedegah, M. et ah, 
10 PNAS (USA) 91:9866-9870 (1994)), and tuberculosis (Tascon, R.C. et al., Nat Med. 
2:888-892 (1996)). 

It will also be appreciated that single or multiple administrations of the vaccine 
compositions of the present invention may be carried out. For example, subjects who 
are particularly susceptible to Ebola virus infection may require multiple immunizations 
15 to establish and/or maintain protective immune responses. Levels of induced 
immunity can be monitored by measuring amounts of neutralizing secretory and 
serum antibodies, and dosages adjusted or vaccinations repeated as necessary to 
maintain desired levels of protection. 

This invention also provides kits comprising the vaccines of the present 
20 invention. For example, kits comprising a vaccine and instructions for use are within 
the scope of this invention. 

The vaccines and methods of the present invention evoke a protective immune 
response and do not lead to immunopotentiation or exacerbated disease. The 
vaccines lack transmissibility, are genetically stable and induce protective levels of 
25 humoral and cell-mediated immunity. 

In order to more fully demonstrate the advantages arising from the present 
invention, the following example is set forth. It is to be understood that the following 
is by way of example only and is not intended as a limitation on the scope of the 
invention. 

30 SPECIFIC EXAMPLE 

I. RESULTS 

Immune response to viral gene products in mice. To characterize immune 
responses to selected Ebola virus proteins, eukaryotic expression vector plasmids 
were injected into mice. The cytomegalovirus (CMV) immediate early region 1 
35 enhancer was used to stimulate transcription because it directs high levels of gene 
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expression in muscle. Manthorpe, M. et al., Hum. Gene. Ther. 4:419-431 (1993). 
cDNAs encoding an abundant structural protein, the major viral nucleocapsid 
phosphoprotein (NP), the secreted glycoprotein (sGP), or the membrane-associated 
glycoprotein (GP) were inserted. Alternative forms of GP were chosen because it had 
5 been postulated that the transmembrane protein contained a protein sequence motif 
also found in oncogenic retroviruses that might suppress immune responses. 
Burkreyev, A.A. et al., FEBS. Lett. 323:183-187 (1993); Cianciolo, G.J.etal.. Science 
230:453-455 (1985); Harris, D.T. et al., J. Immunol. 138:889-894 (1987); Volchkov, 
V.E. et al.. FEBS. Lett. 305:181-184 (1992); Sanchez, A.et al., Virus. Res. 29:215- 
10 240 (1993). Expression of the relevant proteins was confirmed after transfection of 
the human renal epithelial cell line, 293, by immunoblotting with antisera to these 
gene products (Fig. 1A). For NP, the expected full-length 104 kDa protein normally 
produced by the virus was seen, together with lower molecular weight species likely 
generated from truncated protein or degradation products described previously. 
1 5 Sanchez. A. et al., Virology 170:81-91 (1989). Similarly, expression of sGP and GP 
revealed a heterogeneous pattern whose sizes correlated with the expected products 
of cleavage or post-translational carbohydrate modification. Sanchez, A.et al., PNAS 
(USA) 93:3602-3607 (1996). 

These plasmids were injected into mice to characterize their ability to induce 
20 humoral and cellular immune responses to the relevant viral proteins. Three 
injections, each with 50 //g of plasmid DNA in saline (100 //I), were performed at two- 
week intervals in Balb/C female mice (6-8 week old, Charles River). -Serum from 
immunized recipients were examined for antibody responses. An antibody -response 
to the viral NP gene product was readily detectable <Fig. 18), with titers of £ 
25 1:16,000 by Western blot analysis. The titer of antibody induced in response to 
injection with plasmids encoding the viral glycoproteins was lower. After immunization 
with GP, no antibody was detectable by Western blotting, while immunization with 
sGP induced an antibody response (Fig. 1B). The more -sensitive -EUSA <Ksiazek, 
T.G., Lab. Anim. 20:34-46 (1991); Ksiazek, TX3.-et al., J.Clin. Microbiol. 30:947-950 
30 (1992)) did allow detection of antibodies to both 13P and sGP at titers of 1 :4O0 and 
1 : 1.200. respectively. Cytolytic T cell tCTL) responses to these viral proteins were 
analyzed i cext. Despite the substantial humoral immune response to NP. minimal-CTL 
activity was detected against syngeneic cells expressing this viral .protein {Fig. 2A). 
In contrast, genetic immunization with sGP. which elicited a weaker antibody 
35 response, induced a marked cytolytic T cell response tocells expressing -GP<Fig. 3B). 
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Immunization with the GP plasmid also induced a significant CTL response to GP 
(Fig. 2C). These data suggested that both the secreted and transmembrane form of 
the protein could be processed for antigen presentation and the transmembrane form 
was a target for recognition by these cytolytic T cells. Finally, antigen-specific T cell 
5 proliferation to sGP was also observed in GP and sGP but not plasmid control 
injected mice (Fig. 2D). 

The ability of antibodies detected in mouse sera after immunization to 
neutralize virus was tested in an in vitro infection assay. McCormick, J.B. et aL, J. 
Infect. Dis. 147:264-267 (1983). In no case was neutralization of infectivity observed, 
1 0 even at titers of 1 :1 0 (data not shown), despite the documented presence of antibody 
after NP and sGP immunization by Western blot analysis. Infectivity in vitro was thus 
not inhibited by Ebola-specific antibodies. 

Immune function and viral challenge in guinea pigs. To determine whether 
the in vivo immune responses could protect against viral infection, virus was adapted 
1 5 to grow in guinea pigs. Though this species is not well-suited to analysis of immune 
function, infection in adult mice has not been successful. Moreover, infection in 
guinea pigs, used originally to propagate virus from infected humans, is a well- 
established animal model for the human disease. Infection gives rise to a syndrome 
of hemorrhagic fever with levels of virus, organ pathology, and infection of 
20 reticuloendothelial and mononuclear cells comparable to humans. Bowen, E.T.W. et 
al., Lancet 1:571-573 (1977). 

Two groups of immunized guinea pigs were studied. Animals were injected 
intramuscularly with the relevant expression vector plasmids, and the response to 
infection in groups immunized with either sGP. GP, NP, or control plasmids was 
25 observed. In the first group, animals were challenged within 2 months after the initial 
immunization, at which time the antibody titers were high, ranging from 1:1,600 to 
>1:25,000 (Table 1A). In these animals, nearly complete protection from lethal 
challenge was observed in GP (6/6). sGP (5/6), and NP (4/4) immunized subjects, in 
contrast to controls <0/6). In a second group, guinea pigs were chaHenged four 
30 months after the initial immunization (Table 1B). As in the first group, all animals 
immunized with the control vector succumbed to infection within a week after virus 
challenge (n=4). In this group, antibody titers were lower, and three of the four 
guinea pigs immunized with NP succumbed to infection, with the single survivor 
appearing severely ill after 1 week, in contrast to the prot ctive response with NP at 
35 the earlier time point after immunization in Group I. More effective protection was 
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seen in animals immunized with vector expressing GP, protection was noted in four 
of five animals challenged, with one survivor appearing weak but surviving the viral 
challenge. Similarly, three of the five animals immunized with sGP showed no ill 
effects following viral challenge. Protection in this group again correlated with the 
5 ability to sustain an effective immune response to GP or sGP. Together, all guinea 
pigs immunized with vectors expressing GP or sGP which had titers greater than 
1 :5,120 were resistant to infection (11/11) compared to 0/10 controls <p=0, by Fisher's 
exact test). Twelve of fourteen animals with antibody titers 2s 2.560 survived viral 
challenge compared to controls <p=.00003, by Fisher's exact test). Similar to 

1 0 immunized mice, guinea pigs injected with GP or sGP plasmids were able to generate 
cell-mediated immune responses to the viral glycoprotein in addition to the antibody 
response. These responses were antigen-specific and T cell-dependent, as detected 
in sGP antigen-dependent spleen cell proliferation and T-cell growth factor assays 
(Fig. 3A-C). Thus, the ability to generate and sustain significant cellular immune 

15 responses in vivo correlated with protection from infection. Though antibody: titer 
correlated with protection, cell-mediated immunity appeared necessary for protection 
since passive transfer of antibody to GP does not confer protection (Peters, C.J. et 
al., Filoviridae: Marburg and Ebola Viruses, in Fields Virology, (eds.. Fields, B.N., 
Knipe, D.M.& Howley, P.M.) 1161-1176 (Philadelphia, Lippincott-Raven, 1996); 

20 Jahrling, P.B. et al., Arch. Virol. Suppl. 11:135-140 (1996)) and antisera from 
protected guinea pigs did not inhibit virus replication in vivo (n=3) or at a 1 :10 dilution 
in vitro (data not shown). Since the Hartley guinea pig to which the vinjs has been 
adapted for growth is outbred, cellular adoptive transfer studies could not be 
performed. 



TABLE 1 - Group I 



30 



35 



Plasmid 


Subject 


ELISAfPre) 


ELISACPost) Viral A £ 


Survival 


GP 


1 


>1:25,600 


1:12,800 


Yes 


GP 


2 


>1:25,600 


1:25,600 


Yes 


GP 


3 


>1 :25,600 


1 :25,600 


Yes 


GP 


4 


1:25,600 


1:6,400 


Yes 


GP 


5 


1:25,600 


1:12,800 


Yes 


GP 


6 


1:25,600 


1 :25,600 


Yes 


~~SGP 


1 


1:12,800 


1:25,600 


Yes 


SGP 


2 


1 56,400 


1 :25,600 


Yes 


SGP 


3 


1:6,400 


l:25,-600 


Yes 
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30 



SGP 


4 


1:25,600 


1:25,600 


- 


Yes 


SGP 


5 


>1:25,600 


1:12,800 


- 


Yes 


SGP 


6 


1:1,600 


Negative 


+ 


No 


Hp 


1 


1:12,800 


>1:25,600 


- 


Yes 


np 


2 


>1 :25,600 


1:25,600 


- 


Yes 


NP 


3 


1:12,800 


1:12,800 


- 


Yes 


NP 


4 


1:25,600 


1:25,600 


- 


Yes 


Vector alone 


1 


Negative 


Negative 


+ 


No 


Vector alone 


2 


Negative 


n.d. 


+ 


No 


Vector alone 


3 


Negative 


Negative 


+ 


No 


Vector alone 


4 


Negative 


Negative 


+ 


No 


Vector alone 


5 


Negative 


n.d. 


+ 


No 


Vector alone 


6 


Negative 


n.d. 


+ 


No 


Guinea pigs were immunized on days 1, 14, 28, 42, and challenged on day 62 






TABLE 1 


• Group II 






Plasmid 


Subject 


ELISAfPre^ 


ELISAfPostt 


Viral Ag 


Survival 


GP 


1 


1 :2,560 


n.d. 


+/f 


No 


GP 


2 


1:5,120 


1:10,240 




Yes 


GP 


3 


1:10,240 


1:10,240 




Yes 


GP 


4 


1:1,280 


n.d. 


+/f 


No 


GP 


5 


1:5,120 


1:20,480 




Yes (ill) 


SGP 


1 


1:2,560 


n.d. 


+ 


No 


SGP 


2 


1:10,240 


1:5,120 


+/f 


Yes 


SGP 


3 


1:10,240 


1:81,920 




Yes 


SGP 


4 


1:2,560 


1:5,120 




Yes 


SGP 


5 


1:640 


n.d. 


+ 


No 


NP 


1 


n.d. 


n.d. 


+ • 


No 


NP 


2 


n.d. 


n.d. 


+ 


No 


NP 


3 


n.d. 


n.d. 


+ 


No 


NP 


4 


n.d. 


Negative 


+ 


Yes (ill) 


Vector alone 


1 


Negative 


n.d. 


+ 


No 


Vector alone 


2 


Negative 


n.d. 


+ 


No 


Vector alone 


3 


Negative 


n.d. 


+ 


No 


Vector alone 


4 


Negative 


n.d. 


+ 


No 
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Guinea pigs were immunized on days 1, 14, 42, and 112 and challenged on 
day 122. 

n.d.=not done. Viral ag denotes presence of virus determined by 
immunohistochemistry (30) performed on spleen, liver, lung, kidney, and heart 
5 tissues; "+" = widespread systemic involvement of the mononuclear phagocyte 

system and to a lesser extent endothelial and parenchymal cells; "+/f* = focal 
involvement (seen in the spleen of SGP #2, the liver and spleen of GP #1 , and 
the lung of GP#4) where rare sites of anti-Ebola antibody staining were 
detected.; "-" = no Ebola virus antigen detected in tissues. 
10 ELISA determinations made prior to viral challenge (Pre) or at least 7 days 

after (Post) infection, respectively. 

The surviving NP immunized animal (4) was found to have significant levels 
of virus in major organs by immunohistochemistry, and more than 5 logs of 
virus was detected in the serum and spleen, in contrast to GP and sGP 
15 animals where no virus was detected. 

Histopathologic analysis of infection in immunized guinea pigs. 
Pathologic analysis revealed widespread tissue necrosis and dissemination of virus 
by immunohistochemistry, similar to human disease. Virus load correlated with 
susceptibility to infection with titers of 2s 10 5 in infected animals compared to 
20 undetectable levels in immunized survivors. In infected animals, the liver, lung, and 
spleen showed evidence of significant viral antigen by immunohistochemistry (Fig. 4, 
Table 1), and both reticuloendothelial and mononuclear phagotic involvement was 
observed. 

Determination of antibody response in animals which survived virus challenge 
25 revealed increases in the immune response to viral proteins when initial titers were 
lower (Table 1 ). Less consistent increases in antibody titers were observed in the MP 
immunized animals. These data suggest that Ebola virus infection may stimulate 
immunity in survivors of a viral challenge when immune responses are not optimal. 
II. METHODS 

30 Plasmids. Plasmids containing the GP, sGP, or NP cONAs .(Sanchez, A. et 

al., Virus. Res. 59:215-240 (1993), Genbank) were used to subclone the relevant 
inserts into CMV expression vectors which utilized the bovine growth hormone 
polyadenylation sequence. Manthorpe, M. et al., Hum. -Gene. Ther. 4:41 9-431 (1993). 
(see Figures 5-9 and SEQ IO NOS: 1^t). Briefly, for<3P, plasmid pGEM-3Zf(-H3P 

35 was digested with EooR I, treated with the Klenow fragment of €. ooli ONA 
polymerase, and digested with BamH I. The GP fragment was tlien inserted into the 
pCMV expression vector plasmid digested with 8amH I, Ktenow fragment and«gl II. 
For sGP, the plasmid pCRH-sGP was digested with €coY? I, treated with Ktenow 
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enzyme, and the resulting fragment inserted into the BamH l/Bgl II CMV plasmid 
which had been incubated with Klenow fragment, calf intestinal phosphatase (CIP), 
then phenol chloroform extract. For the NP expression vector, plasmid pSP64-NP2 
(Sanchez, A. et al., Virology 170:81-91 (1989)) was digested with EcoR I, treated with 
5 Klenow enzyme, and digested with BamH I. The NP insert was cloned into CMV 
treated with BamH I, Klenow enzyme, followed by heat inactivation and Bgl II 
digestion. 

Cell lines and transfectants. For stable transfectants. the relevant cDNAs 
were inserted into a CMV expression plasmid containing a neomycin resistant gene, 
10 pCMV-neo (H. Arai, unpublished data), which was digested with Xba I, and treated 
with CIP and Klenow enzyme. The EcoR I/ BamH I GP fragment from pGEM-3Zf(-)- 
GP, the EcoR I sGP fragment from pCRII-SGP, or the EcoR l/BamH I NP fragment 
from pSP64-NP2 was treated with Klenow enzyme and ligated to this plasmid 
backbone. These vectors were transfected into Renca or CT26 which was syngeneic 
1 5 to Balb/C mice using calcium phosphate and selected in 0.7 or 1mg/ml G418 for 2-6 
weeks. Expression of GP, sGP, or NP from these vectors in Renca or CT26 cells was 
also confirmed by Western blot analysis (data not shown). 

Cell proliferation assay. Spleen cells from male Hartley guinea pigs or 
Balb/C female mice (8-10 weeks) immunized with the indicated plasmid expression 
20 vectors were incubated with sGP or vector control supernatants (25% volume:volume) 
from transfected 293 cells at the indicated cell concentrations. T cell depletion was 
performed using the CT5 monoclonal antibody (Tan, B.T.G. et al., Hybridoma 4:1 15- 
124 (1985)) (Biosource, Camarillo, CA) for guinea pigs or anti-Thy 1.2 antibody in the 
mouse using immunomagnetic microbeads (Miltenyi Biotec. Inc., Auburn, CA). 
25 Viral challenge in guinea pigs. Animals were immunized by injection of 100 

P\ (0.5 mg/ml) in each hind leg (two injections at each time point) with the indicated 
Plasmid expression vectors. Animals were challenged by inoculation with a stock of 
Ebola virus (Zaire, 1976) that had been passaged once in vero E6 cells and serially 
passaged by intraperitoneal injection of spleen homogenates in Hartley guinea pigs 
30 seven times. Immunized guinea pigs were injected intraperitoneally with 0.5 ml of a 
1:1,000 dilution of spteencell homogenate in Hank's balanced salt solution 122 days 
after the initial plasmid ONA injection (1000 pfu). Survival was determined 10 days 
later at which times animals were sacrificed for serologic and pathologic analysis. 
ELISA, enzyme-linked immunosorbent assay (Volchkov, V.E. et al., FEBS. Lett 
35 305:181-184 (1992); Sanchez, A. et al., Virus. Res. 29:215-240 (1993)) on infected 
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cell supernatants and enriched viral extracts containing GP, sGP, or NP were 
performed as previously described. 
III. DISCUSSION 

Following the initial report that injection of plasmid DNA into muscle could 
5 direct the synthesis of recombinant proteins (Wolff, J.A. et al., Science 247:1465-1468 
(1990)), the suggestion was made that this gene transfer approach may be useful for 
vaccination and was termed genetic immunization. Tang, D.C. et al., Nature 356:1 52- 
1 54 (1 992). This approach has been applied to different infectious diseases, including 
influenza (Ulmer, J.B. -et al., Science 259:1745-1749 (1993); Raz, E.-et al., PNAS 
10 (USA) 91:9519-9523 (1994)), malaria (Doolan, O.L. et al.. J. Exp. Med. 183:1739- 
1746 (1 996); Sedegah, M. et al., PNAS(lfSA) 91 :9866-9670 (1994)), and tuberculosis 
(Tascon, R.C. et al., Nat. Med. 2:888-892 (1996)) and has also been used to 
modulate antibody and cell-mediated immune responses in autoimmune and allergic 
diseases. Raz, E. et al., PNAS (USA) 90:4523-4527 (1993); Waisman, A. et al., Nat. 
15 Med. 2:899-905 <1996); McCormick, J.B. et al., J. Infect. Dis. 147:264-267 < 1963); 
Border, W.A. et al., Nat. Med. 1:1000-1001 (1995). 

The immune response to selected Ebola virus proteins after genetic 
immunization in mice was analyzed and their ability to protect against lethal infection 
in a susceptible animal model, the guinea pig, was tested. The immune analyses 
20 performed in different species suggest similar patterns of response, though the 
specific peptides which may be recognized by the immune -system to confer protection 
in the guinea pig could differ from the mouse. Because the principlesof MHC antigen 
presentation and recognition apply broadly across species (Monaco, J.J., Immunol. 
Today 13:173-179 (1992); Jorgensen. J.L. et al., Annu. Rev. Immunol. 10:835-873 
25 (1992); Zinkemagel, R.M. et al., Immunol. Today 18:14-17 (1997)), the finding thai 
protection was observed in different members of an outbred strain and that similar 
immune responses were seen in different species is not unexpected and suggests 
that this approach may be applicable to humans. 

Immunization with plasmids encoding distinct viral proteins induced different 
30 antibody and cytolytic T cell responses. The broadest immune -response was 
conferred byGP and sGP, which induced both -cellular and humoral immunity to the 
membrane-associated GP. In guinea -pigs challenged with doses of virus -that are 
otherwise lethal, sGP-provicted nearly equivalent protection to GP, wkhoo significant 
difference between these groups. The ability of vectors expressing GP to oonfer 
35 immunity may be explained by the generation of lower molecular weight xiegradation 
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products (Fig. 1B) which could provide sufficient protein for antigen presentation to 
.nduce detectable, cellular, and humoral immune responses in guinea pigs. 

Despite the fact that plasmid DNA injection has been shown to affect the 
.mmune response to different antigens in infectious and autoimmune diseases the 
5 ab,l,ty of individual gene products to protect against disease in vivo is not readily 
pred,ctable. In particular, the rapid rates of Ebola virus replication and the poor 
•mmunogenicity of its proteins had previously rendered it resistant to immune 
mterventions. Several attempts to confer protection with passive transfer of 
.mmunoglobulin were unsuccessful (Peters, C.J. etal., Filoviridae: Marburg and Ebola 
10 V,ruses. in Fields Virology, (eds., Fields, B.N., Knipe, D.M.& Howley. P.M ) 1161-1 176 
(Philadelphia. Lippincott-Raven. 1996); Jahrling. P.B. etal.. Arch. Virol. Suppl 11135- 
140 (1 996)). in agreement with the finding set forth herein that antisera from protected 
an,mals fails to neutralize virus replication in vitro. Previous studies using formalin- 
f.xed virus or purified viral proteins for immunization have also not proven effective 
15 Peters, C.J. et al., Filoviridae: Marburg and Ebola Viruses, in Fields Virology, (eds 

F. elds, B.N., Knipe, D.M.& Howley. P.M.) 1161-1176 (Philadelphia. Lippincott-Raven' 
1 996); Clegg. J.C.S. & Sanchez, A. Vaccines against arenaviruses and filoviruses in 
New Generation Vaccines, (eds., Levine, M.M.. Woodrow, G.C., Kaper, J.B.& Cobon. 

G. S.) 749-765 (New York, NY, Marcel Dekker, Inc. 1997). 

20 It is likely that traditional immunization approaches using protein antigens 

vaccmia virus, or inactivated virus do not allow for appropriate uptake and 
presentat.cn of viral antigens by dendritic or other antigen-presenting cells to induce 
protectee immune responses. It has been shown recently that genetic immunization 
leads to production of recombinant protein(s) in muscle which are delivered to bone 
marrow-derived antigen-presenting cells. Iwasaki, A. et al., J. Immunol. 15911-14 
(1 997); Doe. B. et al., PNAS (USA) 93:8578-8583 (1 996); Corr, M. et al.. J. Exp Med. 
164:1555-1560(1996). Synthesis of Ebola glycoprotein after gene transfer apparently 
allows more efficient processing and presentation and the generation of immune 
responses not seen with virus or with viral vectors. GP is a large molecule which 
conta.ns both T and B cell epitopes. Although antibody levels provide a surrogate 
marker of protection, the fact that passive transfer of antibody did not confer 
protection implies that immunoglobulin switching and synthesis is reflective of the T 
helper response to GP. Genetic immunization stimulates T helper cells to generate 
both CTL and B cel. antibody responses to the virus. Although antibody production 
confirms effective immunization, a productive T cell response, likely involving T H 1 cell 
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stimulation, as shown by the T cell proliferation and CTL assays <Fig. 3), is needed 
for effective immunity. Taken together, these studies suggest that transcription and 
translation of viral genes in host cells by genetic immunization induces alternative, 
more effective, processing and antigen presentation which better stimulates immunity 
5 to Ebola virus. Since there are yet no effective antiviral agents, the ability to generate 
protective immunity by vaccination may prove useful in selected high risk populations, 
particularly in regions of ongoing outbreaks, and among medical and laboratory 
personnel exposed to the virus. Although it remains important to identify agents 
which treat acute infection, genetic immunization may help to limit the spread of this 

10 highly lethal infectious disease. 

The foregoing discussion discloses and describes merely exemplary 
embodiments of the present invention. One skilled in the art will readily recognize 
from such discussion, and from the accompanying drawings and claims, that various 
changes, modifications and variations can be made therein without departing from the 

15 spirit and scope of the invention as defined in the following claims. 

All references cited herein are incorporated by reference as if fully set forth. 
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WE CLAIM: 

1. A pharmaceutical composition comprising a nucleic acid molecule 
encoding an Ebola virus structural gene product operatively-liked to a control 
sequence, in a pharmaceutically acceptable earner. 

2. The pharmaceutical composition of Claim 1, wherein the Ebola virus 
structural gene product is selected from the group consisting of the transmembrane 
form of virus glycoprotein, the secreted form of virus glycoprotein, virus nucleoprotein 
and combinations thereof. 



3. The pharmaceutical composition of Claim 1, wherein the control 
10 sequence is a promoter. 

4. The pharmaceutical composition of Claim 3, wherein the promoter is 
the CMV immediate-early region 1 promoter. 

5. The pharmaceutical composition of Claim 1, further comprising an 
adjuvant. 

1 5 6 - Tne Pharmaceutical composition of Claim 2, wherein the structural gene 

product is the transmembrane form of vims glycoprotein. 

7. The pharmaceutical composition of Claim 2, wherein the structural gene 
product is the secreted form of virus glycoprotein. 

8. The pharmaceutical composition of Claim2, wherein thestructuralgene 
20 product is virus nucleoprotein. 
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9. A method of producing a vaccine against disease caused by infection 
by Ebola virus, comprising the steps of: 

a) administering the pharmaceutical composition of Claim 1 to a test host 
to determine an amount and a frequency of administration thereof to elicit a protective 

5 immune response in said host; and 

b) formulating said pharmaceutical composition in a form suitable for 
administration to a treatable host in accordance with said determined amount and 
frequency of administration. 



10. A vaccine comprising a nucleic acid molecule -encoding the 
10 transmembrane form of the Ebola virus glycoprotein operatively-linked to a control 

sequence, in a pharmaceutical^ acceptable carrier. 

1 1 . The vaccine of Claim 10, wherein the control sequence is a promoter. 

1 2. The vaccine of Claim 1 1 , wherein the promoter is the CMV immediate- 
early region 1 promoter. 

15 13 - The vaccine of Claim 10, further comprising an adjuvant. 

14. A vaccine comprising a nucleic acid molecule encoding the secreted 
form of the Ebola virus glycoprotein operatively-linked to a control sequence, in a 
pharmaceutical^ acceptable carrier. 

15. The vaccine of Claim 14, wherein the control sequence is a promoter. 

20 16 - The vaccine of Claim 15, wherein the promoter is the CMV immediate- 

early region 1 promoter. 

17. The vaccine of Claim 14, further comprising an adjuvant. 



1S. A vaccine comprising a nuoteic acid molecule encoding the €tola virus 
nucleoprotein operatively-linked to a control sequence, in a pharmaceutical^ 
25 acceptable carrier. 
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The vaccine of Claim 18, wherein the control sequence is a promoter. 



20. The vaccine of Claim 1 9, wherein the promoter is the CMV immediate- 
early region 1 promoter. 

21. The vaccine of Claim 18, further comprising an adjuvant. 

5 22. A method of immunizing a subject against hemorrhagic fever 

comprising the step of administering to the host an immunoeffective amount of the 
vaccine of any of Claims 10 to 21 . 

23. The method of Claim 22, wherein the hemorrhagic fever is caused by 
infection with Ebola virus. 

10 24 • The method of Claim 22, wherein the hemorrhagic fever is caused by 

infection with Marburg virus. 

25. The method of Claim 22, wherein the host is a human and 
administration is by intramuscular injection. 

26. The method of Claim 22, wherein the subject receives a second 
1 5 administration of an immunoeffective amount of a vaccine against disease caused by 

infection by Ebola virus or Marburg virus. 
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<3en©ral Description 0 ^ 

DKA pVR 1012-GPCIC) 
Local object 

Created: 09/14/98 04:17PM 
Last Modification Date: ? (no data) 
length: 7003 bp 
storage type: Basic 
torn: Circular 
Cements 

Restriction Map 

BglllMsite 

Clal: 1 she **gg** 
Drain: 1 site gSggffiS 
EcoRVMsite |£™Eg 
HindMI: 1 site 
Hpal: 1 site 



c. 



KasMsHe 
Kpnl:lsite 

NarlMsite grace 

PmllMsite <gg(ijg 
Pstl: 1 site 

PvuMsite ggggg 

Sacll: 1 site gg§g*f 

Sail: 1 site g*gu»c 

XmnMsite g^SSS 
EcoRI:2sites g***g 
Ncol:2sites 
Ndel:2sites §g»*° 
Sphl:2sites ggggg 
Xhol:2sites <gggg 
BamHI:3sites g|*gg 
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Bell: 3 sites 

Functional Map 
CDS {4 signals) 
CMV IE 5' UT 

Start: B66 End: 1129 

CMV IE INT 

Starts 1130 End? 1840 
TbGH 

Start* 4020 Endj 4572 

Kanr 

Start: 6068 Endi 6690 (Complementary) 
Miscjeature (2 signals) 
CMV enhancer 

Stare: 248 Endx 885 
GP(IC) 

Start: 1870 Enflt 4019 

Annotations 
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1 CCGCCCGTTT CCCTGATGAC GGTGAAAACC TCTOACACAT CCAGCTCCCG 
AGCGCGCAAA GCCACTACTC CCACTTTTGO AGACTGTGTA CGTCGACOGC 



51 GAGACCOTCA CAGCTTCTCT GTAAGCGSAT GCCGGOAGCA GACAACOCCC 
CTCTGCCAGT GTCCAACAGA CATTCCCCTA CGGOOCTCGT -CTGTTOGGGC 



101 TCAGGGCGCG TCAGCGGGTG TTGCCGGGTC TCGGGGCTGG CTTAACTATG 
AGTCCCGCGC AGTCGCCCAC AACCCCOCAC AGCCCCGACC CAATTGATAC 



KdeX 



151 CGCCATCAGA GCACATTCTA CTGAGAGTCC ACCATATGCG GTGTCAAATA 
GCCGTAG7C? CCTCTAACA7 CAC1CTCACG TGCTATACGC CACACTTTAT 



201 CCGCACAGAT CCCTAAGGAG AAAATAOCGC ATCAGATTGG CTAT7GGCCA 
GGCGTG7C7A CGCATTCCTC TTTTATGGCG TACTCTAACC GATAACCGGT 



251 TTGCATACGT TCTATCCATA TCATAATATG TACATTTATA TTGGCTCATG 
AACGTATGCA ACATAGGTAT AGTATTATAC ATGTAAATAT AAOGGAGTAC 



301 TCCAACATTA CCGCCATGT? GACATTGATT ATTGACTAG? TATTAATAGT 
AGG7TGTAAT GGCGGTACAA C-CTAACTAA TAACTGATCA ATAATT&TCA 



351 AA7CAATTAC GGGGTCATTA GTTCACAGCC CATATATCCA GTDCCGCGTT 
TTAGTTAATG CCCCAGTAAT CAAG7ATCGG CTATATACCT CAAGGCGCAA 



401 ACATAACTTA CGGTAAATGG CCCGCCTGGC TGACCCCCCA ACGAGCCCCG 
7GTATTGAA7 GCCA7TTACC GGGCGGACCC ACTGGCGGGT TGCTGGGGGC 



451 CCCATTGACG TCAATAATCA CGTATGTTCC CATAGTAACG CCAATAGGGA 
GGGTAACTGC AGT7A7TACT GCATACAAGG CTATCATTGC GGTTA7CCC? 



501 CtTTCCATTG ACGTCAATGG CTGGA3TATT TACGGTAAAC TGCCCACTTG 
GAAAGGTAAC TGCAGTTACC CACCTCATAA ATGCCATTTG ACGGGTGAAC 



551 CCAGTACATC AAGTGTA1CA TATCCCAAGT ACGCCCCCTA TTGACGTCAA 
CGTCArCTAO TTCACATACT A7ACGGT2CA TCCCCGGGAT AACTCCACTT 



601 7GACGGTAAA TGGCCCGCCT GGCATTA7GC CCAGTACATG AOCTTATCCC 
ACTGCCATTT ACCGGGC£QA CCGTAATACC CGTCATGTAC TGGAATACCC 



Ncol 

€51 ACTTTCCTAC TTGGCAoiAC A^TACGTAT TAG3CATCGC TATTACCATC 
TGAAAGGATG AACCGTCATG TAGATGCATA ATCAG7AGCC ATAATGGTAC 



KeoX 

701 GCGATGCGGT 7TTGGCA6TA CATCAATGGG CGTGGAYACC -GGTTTGACTC 
CACTACGCCA AAACCG7CAST GTAGTTACCC GCACCj; TCG CCAAACTGfcG 



751 ACGGGGATT? -CCAAGTCTCC ACCCCATTGA CC7CAATGGG AGTTTGTTTT 
TGCCCCTAAA GC7TCAGAGG TGGGCSAACT GCAGTTACCC 7CAAACAAAA 



-601 GGC^CCAAAA SWCGOPAC TCTCCA* <nCGTAfiCAA CfOCCGGCCA 
CCCTCCTTTC AGCTv'X'Ca^ AAACCiT^AA CAGCAWCTT GAGGCGGGGT 
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851 TTSiSSS*^ WeOCQOTAC GCGTGTACGG 7GGGAGG5CT ATATAAOCAB 

.... .^f!T: ^!. J ! cc f??!? c ssSK 

SOI AGCTCGTT7A GTCAACCOTC AGATCGCCTC OtGACOCCAF ce*eft~r^U 

. . . . . TCSAGCWT c^™^ ^agcggaS cSSaS£ £raoS 

SacXl 

851 rTSfffT?" TACXACA CAC C6GGACC6AT CCAGCCtCCG CCGCCGCGAA 
. > A f CT00AC37 ATCWCWW 6CCCWCCTA GG5CGGAGGC CCCGGCCCTT 

1001 ££?22£?2 OAACGCGGAT TCCOCCTCCC AAGAGTCACC TAAGTACCCC ' 
. ^^^^^^ CTTGCGCCTA AGGGCCACGG TTCTCACTCC ATTCATGGCG 

1051 CTATAGACTC TATAGGCACA CCCCTTTGGC 7CTTA7GCAT GCTXTJWrw 
GATA7CTGAG ATATCCGTGT GGGGAAACCG SaSJa S 



1101 TrrrCGCTTG CGCCCTATAC ACCCCCGCTT CCTATGC-A Tir.r«rV»w 
.... CCCGGATATG TCCGGgSgaJ W ItcWCTA^ 

1151 ?ATAGCTTAG CCTATAGGTG TGCGTTATTG ACCATTATTG ACCXMCtfee 
ATATCGAATC GGA7ArcCAC ACCCAATAAC Sc S 



• . • . 



••••••••••••• 

1201 TAT?< 



_ SSS S S S23S 

1301 ACGGACrCTG TATTTTTACA GGATGGCGTC CCATTTATTA TWAe!*'*™ 

.... .7f?! < ! fi f AC /;?^ TGT "*» GGTAaItaIt 

13S1 AO^OCCST CCCCCCTCCC CGOWTTTTX A^AAACATA 

.... . ™* °?.??J?° aoc * ^gtcaaIaa 

1401 GCCTCCGArc TCCACCCCAA CC7CGCGTAC GTGTTCCGGA CATGGGCTC" 
CCCACCCTAG ACGTGCGCTT ACAGCCCATG CACAACCCCT GTAcSa 

14S1 TCTCCGGTAG CCGCGGAGCT TCCACATCCG ACCCCTCGTC Oitaiwr 
. . . . . ^.^f^" AGGTCTAGGC TcScCX^ CmLS 

1501 AGCCCCTCAT GGTCGCJCGG CAGCTXXTTC CTCCTAACAe mruv^ll 

.... . TC f? < : GAGTA .^*f^«f «S 

1S " ^n"^^ JSSSli* 00 «»«*<wc cmkmooo "ca^aagUwo 

ttMKCGTG TCGTGTXACC GGTGGTGGTC CTCACACGOC CTCTTCCGGC 
1601 TCGCGGTAGC CTAHOWJTCT GAAAATGAGc" G7GCXCATTG GGCreii^ 

.... . . ^.^.?f Aaxwai S 

1651 cSSKSS^SE^ 

.... TACCTTCTCA ATTCCCTCCC CGI CTOCT 1 C TACGTCCCIC 

1701 C?GAGTTCTT GTATTCTGAT AAGAGTCACA CGTAACTCCe Cirvrrj^ 

GACSCAACAA CATAACAC7A PTCTCAG3CT CCATTCACGG CAACGGCACO '. . 



• • • « 
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1751 TGTTAACGGT GGAGCGCAGT GTACTCTGAG CACTACTCG? TGCTGOCCCC 
ACAATTGCCA CCTCCCGTCA CATCAGACTC CTCATGAGCA ACGACCGCGC 



NCOI 

1801 CGCGCCAOCA GACATAATAG CTGACAGACT AACAGACTGT TCCTTTCCAT 
CCCCCCTGGT CTCTATTATC GACTGTCTGA TTG7CTCACA AGGAAAGCTA 



Sail 

ucot F«tz pmii bcii gcenv 

1851 GGGTCTTTTC TGCAGTCACC GTCGTCGACA CGTGTGATCA CATATCGCGG 
CCCA6AAAAG ACGTCACTGG CAGCAGCTGT GCACACTAGT CCATAGCGCC 



BCORI 

IS 01 CCGGCCGGCC GCTCTAGAAT TCTCTAATCA CAGTCMCAT GSGAGGGTCA 
C2CGCGCCCG CCAGATCTTA ACAGATTAGT CTCAGTAdGTA CCC7CGCA6T 



L951 GCGATTCTGC AATTGCCGCG TGAGCGCTTC AOGAAAACAT CTTTCWTCT 
CCCTAACACG TTAACGGGGC ACTCGCCAAG TCCTTTTCTA GAAAGAAACA 



2001 TTGGGTAATA ATCCTATTCC ATAAAGTCTT TTCAATCCCG 7TGGGGGTTG 
AACCCATTAT TAGGATAAGG ATTTCACAA AAGTTAGGGC AAGOOCCAAC 



2051 TACACAACAA TACCCTACAA GTGAGTGATA TOGACAAGTT TCTGTGCCGA 
ATGTGTTCTT A7GGGATCTT CACTCACTAT AACTGTOCAA ACACACGGC? 



2101 CACAAACTCT CTTCAACTAG GCAATTGAAC TCAGTOGGGT TCAAC7TGGA 
CTGTTTGAGA GAAGTTGATC <3GTTAACTTC AGTCACCCCA ACTTCAAOCT 



2151 GCGCAATGGA CTVSCMCSO ATCTACCAAC -GGCAACCAAA AGAMGCGTT 
CCCGTTACCT CATCGTTGAC TACATCGTTC -CCGTTGGTTT TCTACCCCAA 



2201 TTCGAGCTGG TGTTCCACCA AAGGTGGTAA AT7ACGAAGC CtfGAGAATGG 
AAGCTCCACC ACAAGCTGGT TTCCACCATT TAATGCTTCG ACCTCTTACC 



2251 GCTGAGAACT GTTATAACCT GGCTATAAAG AAAGCTGATG -GTAGTCAGTG 
CGAC7C7TGA CAATATTGGA CCGATATTTC TTJCAACTAC CATCACTCAC 



2301 CCTACCAGAA GCCOnXSAGG CAGTCAG&GA TT75CGC0GT TOCCCCTATG 
GGATGGTCTT CGCGGACTOC <^CACTGCCT AAAAGGGGCA ACGGCGATAC 



2351 TACACAAAGT CTCACGAACT GGACCATOGC CAGGAGCACT -CQCCCTTCAC 
ATOTGTTTCA GA GTOC TTQA, CCTGGTACGG <*TCCTC C TGA> GCGGAAAGTG 



2401 AAAGAAGGAG CCTTCWCCT <3TAMACC<Sa QSOCATCAA CAATCATTTA 
GGAAGAAOGA CATACTCGCT GAGCGTAMT GTTAGTAAAT 



2451 TCGGGGTACA ACCTFiCCCQ AAGGAGT^AT SaCATOTCTC ATCTTGCCTA 
ACCCCCATGT TGGAAACGGC TTCCTCAATA ACCrAAACAC TftGAAGGCAT 



2501 AGGCGCCAAX GGATTTTTTC CAGTCTCCTC CATTGCATGA GC2CTCCCAAC 
TCt^CCTTT OCTAAAAAAG tJTCAGAGCK* -GEAACGWCT 0GGACGGT7G 
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BamJU 



2551 ATGACCACGG ATCCCTCCAG TTACTATCAC ACGACAACAA TAAACTACGT 
TACTGCTCCC TAGGGAGGTC AATGATAGTG TGCTG7TCTT ATTTGATGCA 



2501 GGTTGATAAT TTTOGAACCA ACACCACAGA GTTTCTCTTC CAAGTCGATC 
CCAACTATTA AAACCTTGGT TGTGCTOTCT CAAAOACAAG OTTCAGCTAfi 



Xhol 



2651 ATTTGACGTA TGTGCAGCTC GACCCAAGAT TCACACCACA ATTCCTTGTC 
TAAACTCCAT ACACGTCGAG CTCCGTTCTA AGTCTCGTGT TAAGGAACAG 



2701 CTCCTAAATC AAACCATCTA CTC7CATAAC CGCAGAAGTA ACACAACAGG 
GAGGA7TTAC TT7GCTACAT GAGACTATTG GCGTCTTCAT 7GTGTTCTCC 



2751 AAAACTAATC TGGAAAATAA ATCCCACT6T TCATACCAGC A7GGGTGAC7 
TTTTGAT7AG ACCTTTTATT 7AGCCTCACA AC7A7GGTCG TACCCACTCA 



2B01 CCCCTTTC7G GGAAAATAAA AAAACTTCAC AAAAACCC7T TCAAGTGAAG 
CCCGAAAGAC CCTTTTATTT TTTTGAAG7G T77TXCGCAA AGT7CACTTC 



2851 AGTTGTC7TT CCTACCTGTA CCAGAAACCC AGAACCAGGT CCTTGACACG 
TCAACAGAAA GCATGGACAT GC7CTT7GGG TCTTGCTCCA GGAAC7GTGC 



25 01 ACASCGACGG TCTCTCCTCC CA7CTCCGCC CACAACCACG CAGGCGAAGA 
7G7CGCTCCC AGAGAGGAGG GTAGAGGCGG G7GT7GGTGC GTCCGCWCT 



2951 CCACAAAGAA TTGGTTTCAG AGGATTCCAC TCCAGTGG77 CAGATGCAAA 
CGTGTTTCTT AACCAAAG7C 7CCTAAGGTG AGGTCACCAA GTCTACGTTT 



3001 ACATCAACGG AAAGGACACA ATGCCAACCA CAGTGACGGG 7C7ACCAACA 
7GTACT7CCC TTTCCTGTGT 7ACCC77GGT GTCACTGCCC ACATGGTTGT 



Bell 



3051 ACCACACCCT CTCCATTTCC AATCAATGCT CGCAACACTG ATCATACCAA 
TGGtCTCGGA GAGGTAAAGG 77AG7TACCA GCGTTGTGAC TACTATGGTT 



3101 ATCATTTA7C -CGCCTGGAGG CGCCCCAAOA AGACCACACC ACCACACAGC 
TAGTAAATAG CCGGACC7CC CCGGGGT7CT 7C7CGTGTCG 7GG7C7CTCG 



3151 CTCCCAAGAC CACCAGCC&A CCAACCAACA GCACACAATC GACGACACTA 
GACGGTTCTC GTGGTCGGTT GGTTGCfTGT CCTGTCTTAG CTGCTGTGAT 



3201 AACCCAACAT CAGAGCCCTC CAGTAGAGGC ACGGQACCAT CCAGCCCCAC 
TTCCG77GTA GTCTCGGGAG GTCATCTCCG TGCCCTGGtA GGTCGGGGTC 



3251 GGTCCCCAAC ACCACAGAAA CCCACGCCGA ACTTCGCAAG ACAACCCCAA 
CCAGGGGTTG TGGTGTCTTT CGCTGCOOCT TGAACC^IC TGTTGSGCW 



3301 CCACACTCCC AGAACA0CAC ACTGCCGOCA C1CCCAOTCC AAGAGCCCTO 
GGTGTCAGGG TCTTGTCGTG TGACGGCGGT CACGGTAAGG TTCICGGCAC 



3351 CACCCCGAC6 AACTCAGTGG ACCTGOCTTC C7GACCAACA CAATACGGGG 
GIGGGCCTGC TTGAGTCACC TGGACCGAAG CACT GCT TGT GTTA7GCCCC 
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BaaRX 

3401 GGTQACAAAT CTCCTOACAG GATCCACAAG AAAQCGAAGG GATCTCACTC 
CCACTCTTTA GAGGACTCTC CTAGGTCT2C TXTCGCTTCC -CTACAGTCAO 



3451 -CCAATACACA ACCCAAATGC AACCCAAACC TGCACTATTG GACAGCCTTG 
CCTTATGTGT TGGCTTTACG TTCO CTTTGO ACCTCATAAC CTOTCOQAAC 



3501 GATGAGGGTO CTOCCATAGG TTTAOCCTOG ATACCATACT TCGOOCCAGC 
CTACTCCCAC CACGGTATCC AAATCOCACC TATGGTATGA AGCCCGC3CC 



3551 AGCTGAGOCA ATTTACACTC AAGGCATAAT CCASAA5CAA AATGGATMA 
TCGACTCCCT TAAATGTGAC TTCCCTATTA CC1CTTAGTT TTACCTAACT 



3601 TCTGTGGATT GAGGCAGCTG GCCAACCAAA CGACACAAGC TCTTCAATTC 
AGACACCTAA CTCCGTCGAC CCCTTCCTTT GCTGTGTTCC AGAACTTAAC 



3651 TTCTTAACCG CAACTACTGA C7TGCCTACA TTCTCTATAC TAAATCGCAA 
AAGAATTCCC -GV7GA7GAC? CAACGCATST AAGAGATATG ATTTAGGCTT 



3701 AGCAATACAC TTCTTGCTCC AAAGATCGGG AGGAACATCT CACATTCTAG 
TCCTTATCTG AAGAACGACG TTTCTACCCC TCCT1CTACA CTGTAAGATC 



3751 <xk:ctcattg togcattcaa ccccaagatt ogaccaaaaa tatcact-gat 
cccgactaac aacctaactt ggggttctaa cctccctttt atag7gacta 



Bdl 

3801 AAAA TTCATC AAATAATCCA 7CACTTTCTC GATAATAATC TICCAAASCA 
TTTTAACTAG TTTATTACCT ACTGAAACAG CTATTATTAG AAGGTTTAGT 



3851 GAA7CATCGC AGCAACTGCT CGACTGGATG GAAACAATGG GTTCCTOC3G 
CTTACTACCG TCCTTGACCA CCTGACCTAC -CTCTGT7ACC CAAGGACCAC 



3901. CAA7AGGAAT CACACGAGTA ATCATTGCTA TTATTCCT7T GCTCT+W.TT 
CTi ATCCTTA tSTGTCCTCAT TXC I AACGAT AATAACrAAA CCATAOSTAA 



EcoRZ 

3S51 TCCAAATTCA TGCTTTGA>JC 7AATA7ACCA TCATACTTTA CAATTCTAGA 
ACG7TTAACT ACGAAACTTG A77ATATCGT ACTATCAAAT C77AAGATCT 



Narl 



€001 CCAGGCCOCT GOATCCAGA* -CTCCTOTGCC TTCJAGiTCC CA00CA2CTO 
COfCCGCOGA CCTAGGTCTA GACGAC\CCP AAGA9CAACO CTCOGTAfcfcC 



-4051 TTOTTTGGCC C7CCCGCGTG CCTlCCrtGA CCCTCGAAGO TOCCACTCCC 
AACAAACCGG GAOGGGGChC OCAACGAACT -GGGAGCTSCC AGCGMAGGG 



4101 ACTC T C e rrr cctaataaaa tcacgaaatt ccakccact cictcactag 

7GACAGGAAA tHftRASST?- ACTCCTTTAA CCTACCCTAA <CAGACTCATC 



4151 GTG7CATTCT ATTCTGGGG6 -GTGGGSTGGG -GCAGCACAGC AAGGGGGACG 

C&CACTAAG* TXi AMOCOC -CADCCdJCCe ~QGTCG&&CG TTOG00CTUC 



>: <WO 0932147A1 I > 



WO 99/32147 

o PCT/US98/27364 



" M Ti2SS £££££ «««««« AI6CGGTGGC CTCTATGG^ 
- - - . .Tfffi?!?. emrc0BCC «*C»CCCC TACGCCACCC GAGATACCCA 



Kjpai 



~ .™ KHB sss a 

~ srassss sssas 

" s :..~j ess c^s'^»si»' ^™ 

. sss 33SS SSS5 B sssss ' 



Xsnl 



«51 TOWACAAAT CATA5AATTT CTTCCGCTTC CTCGCSCACT CACSCGCGC 
AC3KCTTTA CTATCTTAAA CAAGGCGAAG GACCCAGTGA CTGMCGaS 



• • • • < 



"1= sss a ass sss 
~. sss.sssss sss : s» ais 

'!..^~ : ==ss'' 
k'^s'ssss'sss" 
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5051 


ACTATCCTCT TGAGTCCAXC CCGG7AAOAC ACGACTTATC GCCACTCGCA 
TGATAGCAGA ACTCACCTTG GGCCATTCTG 7GCTGAATAC CCGTQACOGT 




5101 


GCACCCACTG GTAACAGGAT TAGCACACCG AGGTATGTAC GCGG7GCTAC 
CCTCGGTGAC CAT7G7CCTA ATCGTCTCGC TCCATACATC C6CCACGATG 




5151 


ACAC7TCTTG AACTGG7GGC CTAACTACQG CTACACTACA AGGACAGTAT 
7C7CAAGAAC TTCACCXCCG GATTGATGCC CATC7GA7C7 TCCTSTCATA 




5201 


TTGGTATCTG CGCTCTGCTG AAGCCAOTTA CCTTCGGAAA AAGAGTTCGT 
AACCXTAGAC CCCACACfiaC TTCGG7CAAT CGAAGCCTTT TTCTCAACCA 




5251 


TCGAGAACTA CGCCGMTUT 7TGG7GGCGA CCATCGCCAC -CAAAAAAACA 




5301 


TTGCAAGCAG CACATTACGC GCACAAAAAA AGGATCTCAA GAAGATCCTT 
AACGTTCCTC CTCTAATGCG CCTCTTT77T TCCTACAGTT CTTCTAGGAA 




5351 


TC&7C7TT7C TACCGGGTCT GACCCTCAGT GGAACGAAAA CTCACG7TAA 
AC2AGAAAAC ATGCCCCAGA CTGCGAGTCA -CCTTGCTTTT GAGTGCAATT 




5*01 


GGCATTTTGG TCATGA6AT7 ATCAAAAACG A7CTTCACC7 ACATCCTTTT 
CCC7AAAACC AG7AC7CTAA 7AGTTTTTCC 7AGAAGTCGA 7CTAGGAAAA 




5451 


AAA77AAAAA 7CAAGT7T7A AATCAATCTA AAGTA7A7AT CAGTAAAC7T 
TTTAATTTTT ACTTCAAAAT TTAGT7AGA7 77CA7ATA7A CTCAT7TCAA 




5501 


CGTCTGACAC TTACCAATGC TTAATCAG7G AGGCACCTAT CTCAGCGATC 
CCAGAC7CTC AATGGTXACG AATTAGTCAC TCCGTGGATA GAC7CGC7AG 




5551 


7G7C7A7TTC GTTCATCCAT AGTTGCCTGA CTCCGGGGGG CGGGGCCCCT 
ACAGATAAAG CAAGTAGSTA TCAACGGACT CAGGCCOCCC CCCCCCGCGA 




5601 


-GAGG7C7GCC TCC-7CAACAA GG7GTTGCTG ACTCA7ACCA CCCCTGAATC 
C^CCAGACGG AGCACTTCT7 CCACAACGAC 7GAG7A7GG7 CCGGACTTAG 




5651 


CCCCCATCAT CCAGCCAGAA AG7GAAGGAG CCACCCTTGA TGAGAGCTTT 
CGGGGTACTA GGTCGCtfCTT TCACitXCt?C -GGTGCCAAC? AC1C7CGAAA 




5701 


GT7G7ACGTG GACCAGTTOG TGA7TTTCAA CTTTTGCTTT GCCACGGAAC 
<*ACATCCAC C7GGTCAACC ACTAAAACTT GAAAACGAAA CGGTGCCTTG 




-57S1 






<3GTCTGCGT7 -GiCCGGAAGA 2GCCKtt*3T -GATCCT^CAA CTCAGCAAAA 
CC'XGACGCAA CAGCCCT7CT ACGCAC7MA Cv ACGAAGTT GAGTCGTTTT 




5801 


Ci-JTCGATTTA 7TCAACAAAG CCGCCGVvXC ettCA»;GTCAG CGTAATGCTC 
CAACC7AAA7 AAC7TG7T7C GGCCQCACGG <^XJ7ZCACTC -GCATTACGAG 




5051 


YGCCAGTCTT ACAACGAATT AATCAATTCT -GATCAGAAAA ACTCATCCAG 
ACCC7CACAA 7CTTCG7TAA TTGGTTAfcPA C7AATC7TTT TGAGTAGCTC 




5901 


C&TCAAATGA AACTGCAATT TATTCA7ATC AGGATTATCA ATACCATA7T 
G7AG7TTACT TTGACG77AA ATAAGTATOG ^ : «KJ*£AGT TATGCTA7AA 


- 


<9S1 


7TTGAAAAAG -GCGTT7CTGT AATGAAGGAG AXAACTCACC •GAOGCACCTC 
AAi£i?T7TC GCCAAACACA 77AC*v^<^C f ^r-*«AG*GC <7CCG*CAAG 
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" 01 SSS^TSS CAWATCCT8 GTATCOGTCT GCGATTCCCA CICGICCAAC 
GTA3CCTACC GTTCXAGGAC CATAGCCAGA CGCTAAG6CT GAGCACCTTG 

<0S1 CCTATT *A" tcccctcotc aaaaataacg ttatcaactc 

oaxTMim AC6<SGAGCM mnMTa aatacttcac 

<1M "CT^^eTfir JiSwiSliS ACTCAATCCG GTGAGAATCG CAAAAGCTXA 
rCKCACTGG TACTCACTCC TGACTTAGGC CACTCTTACC GTTTTCGAAT 

CXS1 reCATTTCW KCAGACTtC WCAACACGC CACCCATTAC GCTCGICASe* 
ACCTAAAQAA AOOTCTCAAC AAGTTCTCCC GTCGGTAATG CCACcXg 

W01 UMaaCK eCATCAACCA AACCGTTATT CATTCOTOAT TCCCCCWAG 
T3TTAGTGAG CGTAGTTCGT TTGGCAATM CTAAGCACTA ACGCGGACTC 

PVUX 

" 51 'iSS 06 *^ CTKCAAAAG GACAATTACA AACAGGAATC 

GCXCTGCTT? ATGCGCTAGC GACAA777TC CTGTTAATGT WCTCCTTAG 

S30X GAATOCAACC OSCGCAOSAA CACTGCCACC CCAXAACAA »AT3«ICAce " 
.... C ™f S7XG CCCCCICCTT GTGACCGTCG CGTAGTTGTT ATAAAAGTGC 

6351 TGAAICAGGA TATTCTTCTA ATACCTGCAA TGCTGTTTrC CCGGGGATCG 
_ ACTTAC7CCT ATAACAASAT TATGGACCT? ACGACAAAAC GGCCCCTAGC 

" 01 ^ATCAGGAG TACGGATAAA* ATGCTTGATC 

C3CACCACTC ATTGGTACGT ACCAGTCCTC ATCCC7ATTT ?ACCAACTAC 

" 51 ^SS^ 5 CATAAATTC CCTCAGCCAG TTTAGTCTGA CCATCTCATC 
CAGCCTWTC CGCA77TAAG GCAGTCGGTC AAATCAGACT CCTAGAGTAG 

€50X T&1AACATCA OTGGCAACCC TACCTTTCCC ATGTTTCAGA AACAACTCTG 
ACATTCTAGT AACCCTTGCG ATGGAAACGG TACAAAGTCT TTGTTGAGAC 

Clal 

6551 CTCCCATAC WOCGATAGA 7TGTCCCAGC 7GATTCCCCG 

. . . . . CAAGGCTATC TTAGCTATCT AACAGCGTGG ACTAACGGGC 

<<M G * xce * !m ATACCCATAI AAATCAGCAT CCATGTOJCA 

TGTAA2AGCG CTCCGCTAAA TATCCCTATA TTTACICCTA GGTACAAOCT 

Xhol 

6S51 ASWAATCCC C6CCTC6AGC AA6ACGTT3C CCCTTGAATA TGCClCAWUk 
.... . T ^!!* 0CC CCG5MC,C8 "CTCCAAAO a^AACwI? 

8701 SSFS? *P*CT«W ATGTAACCAQ ACA6TTTTAT TGTTCAICA? 

cwseoAACA taatqacaaa tacatkgtc tcxaaaaxa acaagtacta 

Draxzx 

WS1 r™rS»T»r Fi5Sf TI ** C ^ *MGTAACAT CAGAGAT7TT CAGACACAAC 
CZA7ATAAAA ATACAACAC8 TTACATTGTA <7TCTC?AAAA CTCTCTCTW 



BNSOOOO: <WO_9932147A1 J. 
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Drain 

€801 CTCCCTTTCC CCCCO CCCC C AT7ATT0AA0 CATTTAICAC GGTTATTOTC 
CACCGAAAGC GGGGGQGGGG TAATAACTTC GTAAATAOTC CCAATAACAC 



6851 7CATGAGCCC ATACATATTT GAATGTATTT AGAAAAATAA ACAAATAGG3 
AGTACTCGCC TATGTATAAA C7TACATAAA TCTTTTTATT 96TTTA3CCC 



€901 GT7CCCCCCA CATTTCCCC C AAAACTCCCA CCTGACGTCT AAGAAACCAT 
CAA6GCOCGT CT A AACCCQC TTTTCACOOT GGACTGCACA OTCTTTOGTA 



6S51 TATTATCATG ACATXAACCT ATAAAAATAO GCGTATCACC ACCCC CT TT C 
ATAATAGTAC TCTAA7TCGA TA77TTTATC CCCATACTGC 7CCG6GAAAO 



7001 ore 

CAC 
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General Description D 

OKA pVR 1012-GP<5) 
Local object 

Creatcdt 09/14/98 03:58PM 
Last Kodif icatioa Date: ? <no data) 
length i 7073 bp 
storage typei Baaio 
form: Circular 
Cc&ne&ta 

Restriction Map 
Ball:lsHe 
Bdinstte 
Clainsite 

Dram: 1 site g£™» 
Hindlll: 1 site 
Kas!« 1 site ccgccc 
Kpnl:1site ^tg! 
Narl:1sHe gggg 
Pmll: 1 site |*gg£ 
Pvul: 1 site 
Sacll: 1 site §§ocgg 
Sail: 1 site g«*g 
Xbainsite 

Ymnh 1 she GXANKKNTTC 

Ndel:2sites 
EcoRV: 3 sites 
Sphl:3sites g£ 
Ncol: 4 sites <gg<* 
BamHI: 6 sites gg*gj 

Functional Map 

CDS (4 signals) 
C^VIEffUT 
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Start: 836 Ends 1129 
CMV IE INT 

Start: 1130 Ends 1640 
TbGH 

Start: 4090 End: 4642 

Kanr 

Start: 6138 Ends 6760 (Ccmpl arrant a ry) 

Miscjeature (2 signals) 
CMV enhancer 

Start: 248 Ends 885 
GP(S) 

Start: 1370 End: 4089 
Annotations 
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1 TCCCGCCTTT CGGTGATGAC GGTGAAAACC TCTGACACAT GCAGCTCCCC 
A3CGCGCAAA GCCXCTACTG CCACTTTTGG AGACTGTGTA CGTCGAOGGC 



51 GAGACGGCCA CA6CTTCTCT GTAAGCGOAT GCCGCGAGCA GACAACCCCG 
CTCTGCCAGT GTCOAACACA CATTCGCCTA CGGCCCTCCT CTGTTCGGGC 



101 TCAGGGCGCG TCAGCGGGTG TTCCCGGGTG TCCGGGCTGG CTTAACTATG 
AGTCCCGCGC AGTCGCCCAC AACCGCCCAC AGCCCCGACC CAATTGATAC 



Ndel 

151 CGGCATCASA CCAGATTCTA CTGAGACTGC ACCATATCCO GTGTGAAATA 
CCCGTAGTCT CGTCTAACAT GACTCTCACG TGGTATACGC CACACTTIAT 



Ball 

201 CCCCACAGAT GCGTAAGCAG AAAA7ACCGC ATCACATTGG CTATTGGCCA 
GCCGTGTCTA CGCATTCCTC TTTTATGGCG TAGTCTAACC GA7AACCGGT 



251 TTGCATACGT TCTATCCATA TCATAATATC TACATTTATA TTGGCTCATG 
AACGTATGCA ACATAGGTAT AGTATTATAC ATG7AAATAT AACCGAGTAC 



301 7CCAACATTA CCGCCATGTT GACATTGATT ATTGACTACT TATTAATACT 
AGGTTGTAAT GCCGGTACAA CTGTAACTAA TAACTCATCA ATAATTATCA 



251 AATCAATTAC GGGGTCATTA GT7CA7AGCC CATATATGGA GTTCCGCGTT 
TTAGTTAATG CCCCACTAAT CAAC7A7CGG CTATATACCT CAAGGCGCAA 



401 ACATAACTTA CGGTAAATGG CCCGCCTGGC 7GACCGCCCA ACCACCCCCG 
7CTATTGAAT GCCATTTACC GGGCCGACCG ACTGGCGGGT TCCTGGGGGC 



451 CCCATTuACG TCAATAATGA CGTATGTTCC CATAG7AACG CCAATAGGGA 
GGGXAACTGC AGTTATTACT GCA7ACAAGG G7ATCATTGC GGTTATCCCT 



501 CTTTCCATTC ACGTCAATGG GTGGAGTATT TACGG7AAAC TGCCCACTTG 
GAAAGCTAAC TGCAGiVACC CACCTCAT* A ATGCCATTTG ACGGCTCAAC 



Ndel 



551 GCAGTACATC AAGTGTATCA TATGCCAACT ACGCCCCCTA TTGACGTCAA 
CGTCATGTAG 27CACATACT ATACGGTTCA TGCGGCGGAT AACTCCAGTT 



601 TCACGGTAAA TGGCCCGCCT GGCATTATGC CCACTACATG ACCTTATGCC 
ACTGCC^TT ACCGGGCGGA CCGTAATACC -GGTCATCTAC TGGAATACCC 



NCOl 

£51 ACTTTCC7AC TTGGCAGTAC ATCTACGTAT TAGTCATCGC TATTACCATG 
TGAAACCATG AACCGTCATC TAGATGCATA ATCAGTAGCG ATAATGCTAC 



. NCOX 

701 GTGATGCGGT TTTGGCAGTA CATCAATGCC CG7GGATACC GGTTTCACTC 
CACTACGCCA AAACCGTCAT GTAG1TACCC GCACCTATCG CCAAACTGAG 



751 ACGGGGATTT CCAAGTCTCC ACCCCATTGA CGTCAATGGG AGTTTCTTTT 
TGCCCCTAAA CGTTCAGACG TGGGGTAACT GCACTTACCC TCAAACAAAA 
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601 


GGCACCAAAA TCAACGGCAC TTTCCAAAAT -GTCGTAACAA CTCCGCCOCX 
CCGTGG7TTT AGTTGCCCTG AAAGGTTTTA CAGCATTCTT GAGGCCCCGT 




851 


TTGACGCAAA TCGGCGGTAG COGTGTACGC TOGGAGGTCT ATA7AAGCAG 
AACTGCGTTT ACCCGCCATC CGCACATGCC ACGC2CCAGA TA7A7TCGTC 




SOI 


AGCTCCTTTA CTGAACCGTC AGATCGCCTC CAGACGCCAT CCACGCTCTT 
7CGA0CAAAT CACTTGGCAG 7CTAGCCGAC CTCTGCGGTA ■GGTGCGACAA 






sacll 




951 


TTGACCTCCA TAQAAGACAC CGGGACCCAT GGAGCCTCCG -CGGCCCGOAA 
AACTGGAGGT ATCTTCTCTC GCCCTGGCTA *GGTCGGAGGC GCCGGCCCTT 




1001 


CGGTGCATTG GAACGCGGAT ?CCCCGTGCC AAGAGTGACG TAAGTAGOGC 
GCCACGTAAC CC7GCGCCTA AGGGGCACCG TTCTCACTGC ATTCATOGCG 






SphI 




1051 


CTATAGACTC TA7AGGCACA CCCCTTTGOC TCTTATGCAT -GCTATAC9CT 
GATATCTGAG ATATCCGTGT GGGGAAACCG AGAATACGTA CGA7ATGACA 




1101 


rrcTGGcrrc gggcctatac acccccgctt ccttatccta tasgtcatgg 

AAAACCGAAC CCCGGATATG TGGGGGCGAA GGAATACGAT ATCCACTACC 




1151 


TATAGCTTAG CC7A7AGCTG 7GGG7TATTG ACCATTATTG ACCACTOGCC 
ATATCGAATC GGATATCCAC ACCCAATAAC TGGTAATAAC TGGTGAGGGG 




1201 


7ATTCGTGAC GATACTTTCC ATTACTAATC CATAACA7CC CTCTTTOCCA 
ATAACCACTC CTATGAAAGG TAATGATTAG GTA7TCTACC CACAAAGGGT 




12S1 


CAACTATCTC TATTGGCTAT ATGCCAA7AC TCTG1CCTTC AGAGACTGAC 
GTTGA7AGAG ATAACCSATA TACGCTTATG AGACAGGAAG TC1C7GACTG 


* * ••••••••••• 


1301 


ACGGACTCTG TATTTTTACA -GGATGGCGTC -CCA£V*A*TA TTTACAAATT 
TGCC7CACAC A7AAAAA7GT CCTACCCCkG GGTAAATAAT AAATGlTTAA 




1251 


CACATATACA ACAACGCCGT CCCOCCTGCC CGCAGTTT7? ATTAAACATA 
GTGTATATGT TG7TSCGGCA CCGGGCACGG <#CGTCAAAAA 7AATTTGTAT 




1401 


CCGTGGCATC TCCACGCGAA TCTCCGGTAC -CTGTTCCGGA CATOGtf iTCT 
CGCACCCTAG ACGTGCQCTT AQAGCCCXTC CACAAOGCCT GTACCGCAGA 




1451 


TCTCCGGTAG CGGCGGAGCT TCCACATOfcC AGCCCTGCTC CCA7GCCTGC 
AGAGGCCATC GCCGOCTCGA AGGTGTACQC TCGGGACCAG GGTACuG&GG 




1501 


AGCGGCTCAT GGTCGCTCGG CAGCTGCTTG -CTGCTAACAG TCGAGGGCftG 
TCGCCGAGTA CCAGCGAGCC -GTCGAGGA&C -GAGGATtCTC ACCTGCCOTC 


■ • ? 


1551 


AC77AGGCAC AGCACAATCC -CCAOCACCAC CAGTCTGCCG CACAAGGCCG 
TGAATCCGTG TCGTCTTACG GCTGGTGGTG "GTCACAOCCC GTCTTCCCGC 




1601 


TGGCCGTAGG CTATGTGTCT GAAAATGAGC GTGGAGATTG -GGCTGGCACG 
ACCGCCATCC CATACACACA CTTTTACTCG CACCTCTAAC GCGAGCGTGC 




-■651 


GC7GACGTAG ATGGAAGACT TAAGCCAGOG <3CAGA*VGAAG ATOCfcGGCAG 
TAC1 * sCTGA ATTCCGlOiC CGTCT^r^jfSC TATOTtfOCTC 
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1701 C7GAGTTCTT GTAKCTGAT AACAGTCACA CGTAACTCCC GTTGCCCTGC 
GACTCAACAA CATAAGACTA TTCTCACTCT CCATTGAGGG CAACGCCACG 



1751 TCTTAACGG7 GGAGGGCAGT G7ACTCTGAG CAGTACTCGT TGCTGCCCCC 
ACAA77GCCA CCTCCCGTCA CATCAGACTC G7CATGACCA ACCACGGCGC 



ScoX 

1801 CGCCCCACCA CACATAATAC CTGACAGACT AACACAC7GT TCCTTTCCA? 
GCCCGGTGGT C7GTATTATC GACTGTCTGA TTGTCTGACA AGGAAAGCTA 



Sail 

NCOI Poll Bell ECORV 



1851 CCGTCTTTTC TGCAGTCACC G7CG7CGACA CGTGTGATCA GATATCCCGQ 
CCCACAAAAG ACCTCAGTGG CAGCAGC7CT GCAGACTAGT C7ATAGCGCC 



Splil EC03V 



1901 CC<5CTCTAGC TACATGCATG C7CGAGCGGC CGCCAGTCTG ATGGATATCT 
GGCGAGATCG ATCTACGTAC GAGCTCGCCG GCGGTCACAC TACCTATAGA 



Ncol 



1951 GCAGAAT7CT ATCTTCAGGA TCTCGCCATC GAGGGTCTTA GCC7ACTCCA 
CGTCTTAAGA TACAAGTCCT AGAGCGCTAC CTCCCAGAAT CGCATGAGGT 



2001 ATTGCCCAGA GATAAA777C GAAAAA-GCTC TTTCTTTCTT TGGGTCATCA 
TAACGGCTCT C7ATT7AAAG C7777TCGAG AAAGAAACAA ACCCAGTAGT 



2051 TCT7AT7TCA AAAGGCCTTT TCCATGCCT? TCGGTGTTCT GACCAACAGC 
AGAA7AAAG7 T7TCCGGAAA AGGTACGGAA ACCCACAACA CTGGT7G7CG 



2101 AC777AGAAG TAACAGAGAT 7GACCAGCTA GTCTGCAAGG ATCA7CTTGC 
TGAAATCTTC ATTGTCTCTA ACTCC7CGAT CACACGTTCC 7AGTAGAACG 



2151 ATCAACTGAC CAGC7CAAAT CACTTGGTC7 CAACCTCGAG GGGAGCCGAG 
7AGTTGACTG GTCCACTTTA CTCAACCAGA G7TGGAGCTC CCCTCGCC7C 



ScoRV 

2201 TATCTACTGA TATCCCA7C7 GCGACAAAGC G7TGCGGCT7 CAGATCTGGT 
ATAGATGAC7 ATAGGCTAGA CGC7G7TTCG CAACCCCCAA G7CTAGACCA 



2251 G7GCCTCCCC AAGTGCTCAG CTATCAAGCA GSAGAATGGC CTGAAAATTG 
CACGGACGGG TTCACCAGTC GATACT7CCT CC7CTTACCC GACTTTTAAC 



2301 CYACAATCTT GAAA7AAAGA AACCCGACGG GACCGAATGC TTACCCOCAC 
{JATCTTAGAA CTTTATTTCT TTGGCCICCC C7CGCTTACG AATGCCGGTG 



23S1 CGCCGGATGG TGTCACAGGC 7TTCCAAGCT CCCGCTATCT TCACAAAGCC 
GCGGCC7ACC ACACTCTCCG AAAGGTTCCA CGGCGATACA AGTGTTTCGG 



2401 CAAGCAAGCG GGCCCTGCCC CGG7GACTA7 GCCTTTCACA AGGA7GGAGC 
•GTTCCTTGGC CCGGGACGGG CCCACTGATA CGGAAAGTGT TCCTACCTCC 



2451 TTOTTCCTC 7ATGACACCC TGGC7TCAAC TGTAA77TAC ACAGGAGTCA 
AAAGAAGCAG ATACTGTCCG ACCGAAG7TG ACATTAAA7C TCTCCTCAGT 
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2501 ATT7CGCTGA CCGGOTAATC GCATTCTTGA TATTGGCTAA ACCAAAGGAA 
TAAAACGACT CCCCCATTAG CCTAAGAACT ATAACCCATT TO GT T T O CTT 



2551 ACGTTCCTTC AATCACCCCC CATTCGAGAG GCAGCAAACT ACAC7GAAAA 
TGCAAGGAAG TTACTGGGGO CTAAGCTCTC CGTCCTTTGA TCTGACTTTT 



2601 TACATCAAGT rACTATGCCA CATCCTACTT GGACTACGAA ATCGAAAATT 
ATGTAGTTCA ATGATACCGT <5TAGCaTGAA -CCTCATGCCT TAGCTTTTAA 



2651 TTGGTGCTCA ACACTCCACC ACCCTTTTCA AAATTAACAA •TAATACTTTT 
AACCACCAGT TCTCAGGTGC TGGGAAAAGT TTTAATTGTT ATTATG AAAA 



2701 C7TCTTCTCG ACACCCCCCA CACGCCTCAG TTCCTTTTCC AGCTGAATGA 
CAAGAAGACC TCTCCGGGGT GTGCCGAGTC AAGGAAAAGG TCGACTTACT 



2751 TACCATTCAA CTTCACCAAC AGTTGAGCAA CACAACTGGG AAACTAATTT 
ATCGTAAGTT GAACTCOTTG TCAACTCCTT GTGTTGACCC TTTGATTAAA 



2801 GGACACTAGA TGCTAATATC AATGCTGATA T7CGTGAA7C GG CTTTTT GG 
CCTGTGATCT ACGATTATAG TTACGACTAT AACCACTTAC -CCGAAAAACC 



2 851 GAAAA T AAAA AAATCTCTCC GAACAACTAC CTGGAGAACA GCTGTCTTTC 
CTTTTATTTT TTTACAGAGG CTTGTTGATC CACCTCTP C T CGACAGAAAG 



2901 CAAACT77AT CGCTCAACGA GACAGAAGAC GATGATGCCA CATCOTOGAG 
CT7TCAAATA GCGAGTTGCT CTGTCTTCTG CTACTACGCT GTAGCAGCTC 



2S51 AACTACAAAG GCAAGAATCT CGGACCGCCC CACCAGCAAG TATTCGGACC 
TTGATGTTTC C C TT C TT A CA GGCTGCCCCG G7GGTCCTTC ATAAGCCTCG 



3001 TGGTTCCAAA GGATTCCCCT GGGATGGTTT CATTGCACGT ACCACAAGGG 
ACCAAGGTTT CCTAAGGCGA CCCTACCAAA GTAACGTGCA TGGTCTTOCC 



3051 CA*ACAACAT TGG JCTCTCA CAATTCGAC A GAAGGTCCAA GAGVACATGT 
Cr^TCTTGTA ACCCCAGAGT C1XAAGCTGT CTTCC\GCTT CTCATCTACA 



3101 CAATACTCAG GAAACTATCA CACAGACAAC TGCAACAA2C ATACGCACTA 
CTTATGAGTC CTirGATAGT -GTCTCTGTTG ACGTTGTTAG TATCCGTCAT 



3151 ACt-JGTAACAA CATOCAGATC TCCAGCATCG GGACAGGACT CAGCTGCAGC 
TGCCA7TGTT Ci/ACGTCTAG AGGTGCTAGC CCTGTCCTGA CTCGACCTCG 



NCOl 

3201 CAAATCC*GA G^Cf/PCACC AftCCATCQCA CCAAGCGCTC ACACTCAQAC 
GCTTAGGACT Cf^PGAGTCG CV3GTACCGT GGTTCGGGAC TCTCAGTCTG 



3251 CyCCACAACC TACACACCAA AACTACCAGT GATCACCACC CAGGAACCAA 
GAGCTGTTGC ATGTGTCCTT TXhTGGTCh CTACTGSTCG CTCCTMGTT 



3301 C*3£HX?iCe G&yAAACTCT GCTGGCTCAA CAACAG&AGC AGCCACTOXC 
GTTGTCCVGG CXcTTTGACA GGACCCAGTT CTXCTCTTCC TGGGTGACAG 



3351 AJCCACCCCAG ACAATATAAC AACAGCGGTT AAAACTGTTT GGGCACAAGA 
TGCTGGGGTC TCTTATATTO TTC9GGCCAA TTTTGACAAA COCCTOTKT 
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3401 ^Hfii5 C AACGGTCTA* TAACITCAAC AGTAACACM ATTCHCCea. 

™.. T ^ (MAVr ATTCM «™ TCATTCTCCA S££ 

3451 A«»«*«C AAGrTAACAC CACCCc'cACG 

.... /_^. . «SCITTTaCG TCCTCTTCTO TOCAATTGTG C7CCC6GT0C 

35W 2^?** AWCCUWT ACACTACTGG ACTCCACAAG AACUCATM 
.... - !!! "."^"^ ^WOCC rGACGTGTTC mm» 

BaaHi 

3551 I^I^S ^"GCCTGCJA TCCCCTACTt TGCAOCGGGT CCAGAAGGCA 
.....^?ff? TAACCGACCT ACGGCATGAA ACCTGGCCCA CS3CTTCCGT 

3601 Fil^STS ^^CCAAA* ATCCCTTAGT 'cmi^ 

.... "™!T.?f! TCCGGAATAC CTCTTOGTTT TACGGAATCA CACACCTGAC 

3651 i2£*i5 TTS CAAATCAA AC AACTCAAGCT CTCOSciw K^AAsaae 
.... ."7^^ ^*CTT76 WCAGTTCGA CACGTCGAAA 

37 " CTGCCGACAT ATACCATAcV CAaVaGGAAg' COCATACATT 

BaioHl 

3751 ACGATGGCCC GGGACATCTA GGATCCTGOG ACCACAWST 

.... . . ^ TCCTACCCC6 CCCTCTACAT CCTMGACCC TCGCCtSS 

38M ^rr^ CACATCA ™S GACCAAAAAC ATCACTGATA AAATCAACCA 
.... .*f GTAACTC ° ST6TACIMC CTGGTTCTTC TAGTgS TOAgS 

3851 nS ^ ^ '« " " 

. _ . . C / AAAGTAGC ^GGGAAA TGGGTTACTC CTATTACTAC 

EaaKl 

3SW Sf~I'fS5?5 GACGGCCTGG ACACAGTGGA TCCCTGCAGC AATAGGCAT"" 
.... 7 A ^f^ C ^GACC TCTGTCACCT AGGGACGTCC TTATCCGTAA 

.... . . ! AACGTTA CTAACGAGAA GAAACCCAGA CGTTCGACGA 

BaaKI 

<001 ^G-.^AATA TCAGMTTCC AGCACTGCCG GCCGTTACTA GTGGATCcnjk 

.... . A !. AGrcTrAACC MK cggcaItgat cmoSS 

Marl 

B ^ *bal K«*I BaaHi 

sssa ss§ ™ s— 
" u .™ ass is sss ™ 
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<201 GCATCGCA7T CTCTGAGTAG GTG7CATTCT ATECT6GGGG GW3GGGTGGG 
CCTAGCGTAA CAGACTCATC CACAGTAAGA TAAGACCCCC CACCCCACCC 



sphi 



4251 CCAGCACAGC AAGGCGGAGG ATTGGGAAGA CAATAGCAGG CA7GCTGGGG 
CGTCCTGTCG TTCCCCCTCC TAACCCTTCT GTTATCGTCC CTACGACCCC 



Kpnz 



4301 ATGCGGTGGG CTCTATGCGT ACCCAGGTGC TGAAGAAXTG ACGCGGTTOC 
TACGCCACCC GAGATACCCA TCCGTCCACC ACTTCTTAAC TGCGCCAAGG 



4351 TCCTGGGCCA GAAAGAACCA GCCACATCCC CTTCTCSCTG ACACACCCTC 
AGGACCCGCT C T TTCTT C GT CCGTGTAGGG GAAGACACAC TGTGTGGGAC 



4401 TCCACGCCCC TGGTTCTTAG TTCCAGCCCC ACTCATAGGA -CACTCATAGC 
AGGTGCGGGG ACCAAGAATC AAGGTCGGGG TGAGTATCCT GTGAGTATCG 



4451 TCAGGAGGCC TCCGCCTTCA ATCCCACCCC -CTAAAGTAC? TGGAGCGGTC 
AGTCCCCCCG AGGCGGAAGT TAGGGTGCCC GATTTCATGA ACCTCGCCAG 



4501 TCTCCC1CCC TCATCAGCCC ACCAAACCAA AGCTAGGCTC CAXGAGTGGG 
AGACGGAGGG AG7AGTCGGG 7GGTTTGGTT 7GGATCGGAG GTOCTCAOCC 



4551 AAGAAA77AA AGCAAGATAG GCTATTAAGT GCAGAGGGAC AGAAAATGCC 
TTCTTTAATT TCGTTCTATC CGATAATTCA CGTCTCCCTC TCTTTTACGG 



xani 



4601 TCCAACATCT GAGGAAGTAA TCAGAGAAAT CATAGAATTT CTTCCGCTTC 
AGGTTGTACA CTCCTTCATT ACTCTCTTTA GTATCTTAAA <jAAGCCGAAG 



4651 CTCGCTCACT GACTCGCTGC CCTCGGTCCT TCGGCTCCGG -cgaccggtat 

gagcgagtga ctc\gcgacg cgagccagca agccgacgcc gctcgocava 



4701 cagctcactc aaaggcggta atacggttat ccacagaatc aggggataac 
gtcgagtgag tttccgccat tatgccaata gctgtcttac scccctattg 



4751 gcaggaaaga acatgtgagc aaaagcgcag caaaaggcca -ggaaccgtaa 
cgtcctttct tgtacactcg ttttccggtc cttttcoggt -octtggcatt 



4801 aaaggccgcg ttcctggcgt ttttc^atag gctccgcccc cctgaggagc 
tttccggcgc aacg <ccgca aaaac i^mc cgaggc^oog ggacwtcg 



4851 ATCACAAAAA TCGACGCTCA AGTCAGfe?CT GGCGA>J*CCC GACAGOCTA 
TAGTGTTTW AGCTGCCAGT TCACTC1CCA -CCGCT^PSCG CTGTCCT6AT 



4901 TAAAGATACC AGGCGTTTCC CCCTGGAAGC TCCCTCC.TGC CCXffGCZ&T 
ATTTCTATGG TCCGCAAAGG GGCACCi^CG AGGOAGCACG CGACAOG^CA 



4951 TCCGACCCTG CCGCSJAOCC GATACCTtfJC CGCWiSCTC -OH^GGGGA* 
AGGCTGGGAC GGCGAATGGC CTATCGACAG CCOCAAAGAC GGAftGGCCTT 



5001 GCGTGGCGCT TTCTCAATOC TCACCCTCTA OGTATCtCAG TKGGTGTftC 
CGCACCCCGA AAGACTTACG ACTGCGACAT CCATAGACTC AAGOCACATC 
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5051 CTCGTTCCCT CCAXOCTGGG CTCTGTGCAC GAACCCCCCG TOCAGCCCGA 
CAGCAAGCGA GCTTCGACCC GACACACGTC CTTGGOGCGC AAGTCCCGCT 
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S101 CCGCTCCCCC TTATCCGG7A ACTATCG7CT TCAOTCCAAC CCCGTAAGAC 
GGCGACGCGG AATAGGCCAT TGATAGCACA ACTCAGGTTG GGCCATTCTC 



5151 ACGACTTATC CCCACTGGCA CCAGCCACTC CTAACAGCAT TAGCAGAGCG 
7CCTGAATAC CGGTGACCGT CGTCGC7GAC CATTCTCCTA A7CCTCTCGC 



5201 AGCTATGTA6 GCG6TGCTAC AGA G TT C TTO AAGTGGTGGC CTAACTACGG 
TCCATACATC CGCCACGATG TCTCAAGAAC TTCACCACCG GATTGATGCC 



5251 CTACAC7AGA AGGACAGTAT TTGGTATCTG CGCTCTCCTG AAGCCAGTTA 
GA7CTGATCT TCCTGTCATA AACCATAGAC GCGAGACGAC TTCGGTCAAT 



5301 CCTTCCGAAA AAfiAGTTGC? AGCTCTTGAT CCGGCAAACA AACCACCGCT 
GGAAGCCTT? TTCTCAACCA TCGACAACTA GGOCCTTTGT TTCGTGGCGA 



5351 GCTAGCGGTC GTTTTTTTGT TTGCAAGCAG CAGATTACGC CCAGAAAAAA 
CCATCGCCAC CAAAAAAACA AACGTTCGTC CTCTAATCCC CGTC gTTTT T 



5401 ACGATCTCAA GAAGA3CCTT TGATCTTTTC TACGGGGTC7 GACGCTCAG7 
TCCTAGAGTT CTTCTAGGAA ACTAGAAAAG ATGCCCCAGA CTGCGAGTCA 



5451 GGAAC GAAAA CTCACGTTAA ©GCATTTTGG TCATGAGATT ATCAAAAAGG 
CCTTGCTTTT GAGTGCAATT CCCTAAAACC AGTACTCTAA 7AGTT7TTCC 



5501 ATCTTCACCT AGATCCTTTT AAA7TAAAAA TCAAGTTTTA AATCAATCTA 
TAGAACTGGA TCTAGGAAAA TTTAATTTTT ACTTCAAAAT TTAGTTAGAT 



5551 AAGTATATAT GACTAAACT7 GCTCTGACAC TTACCAATGC TTAATCAGTG 
TTCATATATA CTCATTTGAA CCAGAC7CTC AATGCTTACG AATTAGTCAC 



5601 AGGCACCTAT CTCAGCGATC TGTCTATTTC GTTCATCCAT AGTTCCCTGA 
TCCGTCGATA CAGTCGCTAG ACAGATAAAG CAAGTAGGTA TCAACGGACT 



5651 CTCCGGGGGG CGGGGGCGCT CAGGTCTGCC TCGTCAAGAA GGTGTTGCTG 
CAGGCCCCCC CCCCCCGCGA CTCCAGACGG AGCACTTCTT CCACAACCAC 



5701 ACTCATACCA GGCCTCAATC GCCCCATCAT CCAGCCAGAA AGTGAGGGAG 
TGAGTATGGT CCCCACTTAG CGGGGTAGTA GGTCGGTCTT TCACTCCCTC 



5751 CCACGGTTGA TGAGAGCTTT CTTGTAGCTG GACCAGTTGG TGATTTTGAA 
GGTGCCAACT ACTCTCGAAA CAACATCCAC CTGCTCAACC ACTAAAACTT 



5801 CSTOTGCTTT GCCACGGAAC GGTCTGCGTT GTCGCGAAGA TCCGTGATCT 
GAAAACCAAA CCGYGCCTT G CCAGACGCAA CAGCCCTTCT ACGCACTAOA 



5851 GAVCCTTCAA C9CAGCAAA& GTTCGATTTA TTCAACAAAG CCGCCCTCCC 
CTAGGAAGTT GAGTCCTTTT CAAGCTAAAT AACTTGTTTC GGCGGCftGGC 



5901 G7CAAGTCAC -CGTAATGCTC T-GCCACTGTT ACAACCAATT AACCAATTCT 
CAGTTCAGTC GCATTACGAG ACGGTCACAA TGTTGGTTAA TTGGTTAAGA 



5951 GATTAGAAAA ACTCATCGAG CATCAAATCA AACTGCAATT TATTCATATC 
CTAATCTTTT TGAGTACCTC GTAGTTTACT TTCACGTTAA ATAAGTATAG 
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€001 AGCATTATCA ATACCATA3T TTTGAAAAAG CGGTTTCCQT AATGAAGGAG 
TCCTAATAGT TATOGTATAA AAA CTTTT T C GCCAAAOACA TTACXTCCTC 



€051 AAAACTCACC CAGOCAGTTC CATACCATOC CAAGATCCTO GTATCGSTCT 
TTTTGAGTGC CTCCGTCAAG CTATCCTACC GTOCTAGCAC CATAGCCAGA 



€101 GCGAWCCGA CTCGTCCAAC ATCAATACAA -CCTATTAATT TCCOCTOCTC 
CGCTAAGGCT GAGCACCTTG TAGTTATGTT CGATAATTAA AGGGGACCAG 



6151 AAAAATAAGG TTATCAAGTG AGAAATCACC ATCAGTGACG ACTGAA5CCG 
TTTTTATTCC AATAGTTCAC SCTTTAGTOG TACTCACTOC TGACTTAGGC 



Hlndlll 

€201 GTGAGAATGG CAAAAGCTTA TGCATTTCTT TCCAGACTTG OTCAACAGOC 
CACTCTTACC GTTTTCCAAT ACGTAAAGAA AGGTCTCAAC AAG7TGTOCG 



€251 CAGCCATTAC GCTCGTCA2C AAAATCACTC GCATCAACCA AACCGTTATT 
GTCGCTAATG CGAGCACTAG TTTCAGTGAG CCTAGTTGC7 TTGGCAATAA 



PvuX 

6301 CATTCGTOAT TGCGCCTGAC -CGAGACCAAA TACGCGATCG CTCTT AAAAG 
CTAAGCACCA ACGCGGACTC GCTCTGCTTT ATGCGCTAGC GACAAT7TTC 



€351 GACAATTACA AACAGGAATC GAATGCAACC GGCGCAGGAA CACTGCCAGC 
CTGTTAATGT TTG7CCTCAG CT7ACGTTGG CCGCGTCCTT G7GACGG1CG 



€4 01 GCATCAACAA TATTTTCACC TGAATCAGGA TATSCTTCTA ATACCTGGAA 
CCTAGTTGTT ATAAAAGTGG ACCTAGTCCT ATAAGAAGAT TATGGAGCT? 



€451 TGCTCTTTTC GCCGGGATCC -CAGTGGTGAG TAACCATGCA TCATCAGGAG 
ACGACAAAAG GGCCCCTAGC GTCACCACTC A7TGCTACGT ACTAGTGCTC 



€501 TACGGATAAA ATCCTTGATG GTCGGAAGAG <3CATAAAT?C -CGTCACGCAG 
ATCCCTATTT TACGAACTAC CAGCCTTCTC CCTATTXTAAG GCAGTCG^TC 



€551 TTTACTCTGA CCATCTCATC TGTAACATCA TTGGCAACGC TACCTTTGCC 
AAATCAGACT CGTAGAG7AC ACATTGTAGT AACCCTTGCG ATGCAAAGGG 



Clel 



€601 ATG5TTCAGA AACAACTCTC GCGCATCGGG CWCCCATAC AATCGATA» 
TACAAAC7CT TTGVTGAGAC CGCGTAOCCC X3AAGGGTATG TFAGCTA < 5>CT 



€651 C7GTCGCACC TCATTGCCCG ACATTATCCC GJkGCCCATTT ATAGCCVvAY 
AACAGCCTGG ACTAACGGOC TGTAATAGCG CTCGGGTAAA TATCGCT&Vi* 



6701 AAATCAGCAT -CCATGTTGGA ATTTAATCOC GGCCTCGAGC AAGACGlii^C 
TTTAGTCGTA GGTACAACCT TAAATTAOCG -COGGAGCTCG TTCTGCAKAS 



€751 CCCTTGAATA DGGCTCATAA •CAOCCSCTTOi' ATTACTGTTT ATCTAA^sS 
GGCARCTTA!? ACCGAOEA2T CTGQGGAACA TAATOACAAA TACAWCOTC 



€801 ACAOTTTTAT TGTTCATGAT GATATASTTT TATCTTGTGC AATG7AACAT 
TGTCAAAATA ACAAGTACTA CTATATAAAA ATAGAACACG TTACATTGTA 
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Drain 

6851 CAGAGATTTT GAGACACAAC CTGQCTTTCC CCCCCCCCCC ATTATTGAAG 
CTCTCTAAAA CTCTGTGTTC CACCCAAAGG GCCGGGGGGG TAATAACTTC 



€901 CATTTATCAG GGTTATTGTC TCA5GAGCGC ATACATATTT GAATCTATTT 
GTAAATAGTC CCAATAACAC AQTACTCGCC TATGTATAAA CTTACATAAA 



6951 AGAAAAATAA ACAAATAGGG GTTCCGCGCA CATTTCCCCG AAAAGTGCCA 
TCITTTTATT TGTTTATCCC CAAGGCGCGT GTAAAGGGGC TTTTCACCCT 



7001 CCTCACOTCT AAGAAACCAT TATTATCATC ACATTAACCT ATAAAAATAG 
GGACTGCAGA TTCTTTGGTA ATAATAGTAC TGTAATTGGA TATTTTTATC 



7051 GCGTATCACG AGGCCCTTTC CTC 
CGCATAGTGC TCCGGGAAAG CAG 
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pVR 1012 -CP (Z) 

General Description 



DNA pVR 1012 -CP <Z) 1) 
Local object 

Creatadi 09/15/98 05:06PM 
Last Modification Date: ? (no data) 
length: 7285 bp 
storage type: Basic 
form; Circular 
•Ccsontenta 



GTCKKMCAC 
7TCGXX 




GATR.TC 
CTXTAG 



GCATGC 



Functional Map 

COS (4 signals) 

CMV 1E 5* UT 

Start: 886 Bad: 1129 
CMV IE INT 

Start: 1130 2nd* 1840 
TbGH 

Start: 430? End: 4854 
Kanr 

Start: €350 zpCi €972 (Conp&flRuaitary) 
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Miscjeature (2 signals) 

CMV enhancer 

Startx 248 2nd: 883 

GP(Z) 

Start* 1870 End: 4301 

Annotations 
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1 


TCGCCCGTTT CGGTGATGAC GGTGAAAACC TCTCACACAT GCAGC7GGCG 
ACCGCCCAAA GCCACTACTG CCACTTTTGG AGACTGTGTA COXCOAOGGC 




51 


GAGACGGTCA CAGCTTCTCT GTAACCGGAT CCCCGGAGCA GACAAGGCCO 
CTCTOCCAGT GTCGAACAGA CATTCGCC7A CGGCCCTCGT CTCTTQGGQC 




101 


TCAGGGCGCC 7CAGCCGGTG TTCOCGGGTG 3CGGGGCTGG CTTAACTATG 
AGTCCCGCGC AGTCGCCCAC AACCOCCCAC AGCCCCCACC ^GAATTGATAC 










151 


CGCCATCACA GCAGATTG7A CTGAGAGTGC AOCATXTGCC GTGTGAAATA 
GCCGTACTCT CCTCTAACAT GACTCYCACG 7GG7ATACGC CACACTT7AT 




201 


CCGCACAGAT GCGTAAGGAG AAAATACCGC ATGAGATTGG CTATTGGCCA 
GGCGTCTCTA CCCATTCCTC TTTTATOCCG TAGTCTAACC -GA7AACOGGT 




251 


TTCCATACGT TGTATCCA7A 7CATAATATC TACATTTATA TEGGCTCATG 
AACGTATCCA ACATAGGTAT AGTATTATAC ATGTAAATAT AACCGAGTAC 




301 


TCCAACA77A CCGCCATGTT CACATTGATT ATTGACTAGT TATTAATAGT 
AGGTTGTAAT GGCGGTACAA CTGTAACTAA TAACTGATCA ATAAT7ATCA 




351 


AATCAATTAC GGGGTCATTA GT7CA7AGCC CATATATCGA CTTCCCCC77 
TTAGTTAATG CCCCAGTAAT CAAC7A7CGG CTATATACCT -CAAGGCGCAA 




401 


ACATAACTTA CCGTAAATGG CCCGCCTGGC TCACCGCCCA ACGACCGCCG 
7G7ATTGAAT CCCATTTACC GGGCGGACCG ACTGGCGCGT TGCTCGGGGC 




<51 


CCCATTGACG TCAATAATGA CGTATCTTCC CATAGTAACG CCAA7AGGGA 
GGGTAACTGC AGTTATTACT GCATACAAGG GTATCATCGC GCTTATCCCT 




501 


C7TTCCATTG ACGTCAATGG GTCGAGTATT TACGGTAAAC TGCCCACTTC 
CAAAGGTAAC TGTAGTTACC CACCTCATAA ATOCCATT7S ACGGGTCAAC 






Ndei 




551 


Gt^GTACA7C AAGTGTATCA TATGCCAAGT AGGCCCCCTA 77GACGTCAA 
CG^CATGTAG TTCACATAGT ATACGGTTCA TCCGGCGGA7 AACTGCAGTi* 




«01 


TGaCCGTAAA TCGCGCGCCT GCCA77ATGC OCAGTACATG ACCTTATGCG 
AC^GCCATT? ACCGCGCGGA CCCTAATACG GGTCATG7AC TCGAATAOCC 






Ncol 




£51 


ACCTTCCTAC TTGGCAG7AC ATCTACG7AT TAGTCATCGC TATTACCAM 
TGAAAGGATG AACCGTCATG TAGATGCATA AXCAGTAGGG ATAATCCTAC 










701 


GuCATGCCGT 77TGGCAGTA CATCAATGGG CG7GGATASC ^GTTTCACXC 
CACTACGCCA AAACCGTCAT -GTAG7TACCC OCftCCTASOG -CCAAACTOAG 




751 


ACCCCGA7TT CCAAGTCTCC AGCCCATTGA CGTCKATCGG AGTTOGySTT 
TCCCCC7 AAA GGTXCAGAGG TCGGGTAACT GCAGTTAOOC TCAAACAAAA 




801 


G^CACCAAAA TC/.ACGGGAC TTTGCAiAA* <5TCG?A. 7 ^?.^ . 
CCG7CCTTTT AC^^CCCTC AAAGCWviA CAGCkTTGI'a CAGGCOOOC*i' 
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t%l TTGACC CAAA TOGGCGOTAG GCC7GTACGG TCGGAGGTC* ATATAAGCAC 
AAC7CCGTTT ACCCGCCATC CGCACATGCC ACCCTCCAGA TATATTCGTC 

901 AGCTCCTTTA CTCAACCCTC AGATCGCCTO GAGACGCCAT CCACCCTOTT 
TCGACCAAAT CACTTGGCAG TCTAGCCGAC CTCTCCOGTA CCTGCGACAA 



sacix 



951 TTGACCTCCA 7A6AAGACAC CCGGACCGAT CCAfiCCTCCO CGGCCCGGAA 
AACTGGAGGT ATCTTCTGTG GCCCTGGCTA GGTCGGAGGC GCCCGCCCTT 



1001 CCGTGCATTG GAACGCOGAX 7CCCCGTGCC AAGAG7C&CG TAAGTACCGC 
GCCACCTAAC CTTGCGCCTA AGGGGCACCG TTCTCACTGC ATTCATGGCG 



SphI 



1051 CTATAGACTC TATAGGCACA CCCC77TCGC TCTTATGCAT GCTATACTGT 
GA2A7C7GAG ATATCCCTGT GGCGAAACCG ACAATACGTA COATATGACA 



1101 TTTTCGCTTG CGGCCTATAC ACCCCCGCTT CCTTATGCTA TAGGIGATCC 
AAAACCGAAC CCCGGA7ATG TGGGCGCGAA CGAATACGAT ATCCACTACC 



1151 7ATAGCTTAG CCTATAGGTQ TGGGTTATTG ACCATTATTC ACCACTCCCC 
ATATCGAATC GGATATCCAC ACCCAATAAC TGG7AATAAC TGGTGACGGG 

1201 TATTGGTCAC GATACTTTCC AYTACTAA7C CATAACATGG C1CTTTGCCA 
ATMCCACTG CTATGAAAGC TAA7GATTAG GTATTGTACC GAGAAACGCT 

12S1 CAACTATCTC TATTGGCTAT ATGGCAATAC TCTGTCCTTC ACAGACTGAC 
GTTGATAGAG ATAACCGA7A TACGGTTATG AGACAGGAAG TCTC7GACTC 



1301 ACGGACTCTC TATTTTTACA GCATGGGGTC CCATTTATTA TTTACAAATT 
7GCC7GAGAC ATAAAAATGT CC7ACCCCAG GG7AAATAAT AAATGTTTAA 

1351 CACATATACA ACAACGCCC? CCCCCCTGCC CGCAGT7TTT ATTAAACATA 
<5TCTATATGT TGTTGCCGCA GGGGGCACGG CCG7CAAAAA TAATTTGTAT 



K01 GCGTGGGATC TCCACGCGAA TCTCGGCTAC GTGTTCCGGA CATGGGCTCT 
CGCACCCTAG AGGTGCGCTT AGAGCCCATG CACAAGGCCT GTACCCGAGA 



1<51 7C7CCGCTAG CCGCGGAGCT TCCACATCCG AGCCCTGGTC CCATGCCTCC 
AGAGGCCATC GCCGCC7CGA AGCTttTAGGC TCGGGACCAG GG7ACGGAG6 

1501 AGCGCCTCAT ■CCTOGCTCGC CAGCTCCTTG CTCCTAACAC TGGAGCCCAG 
TCCCCGAGTA <2CAGCGAGCC CTCGAGGAAC GMH5ATTCTC ACCTCCGGTC 

1551 AC7TAGGCAC AGCACAATGC CCACCACCAC CAGTGTGCCC CACAAGGCCG 
TGAATCCGTG rCGTCTTACG CCTGGTGGTG GTCACACCGC GTGT7CCGGC 

1601 TGGCGGTAGG GTATGTOICT GAAAATGAGC G^GGAGAT7G WtCGCACG 
ACCCCCATCC CATACACAGA CTTTTACTCG CACCIttf AAC CCGAGCCTCC 

1*51 -GCTCACGCAC ATGGAAGACT TAAGGCAGCG GCAGAAGAAC ATGCAGGCAG 
CGACTGCCTC TACCTTCTGA ATTCCGTCCC CG7CT7STTC TACGTCCGTC 

1701 CTGAG7TC7T -GTATTCTGAT AACAGTCAGA GGTAACTCCC GTTGCGGTCC 
GAC7CAACAA CATAAGACTA TTCTCAC7CT CCAYTCAGGG CAACCCCACC 
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HpaZ 

2751 TGTTAACGGT CGAGGGCAGT CTAGTCTGAG CAOTACTCGT TGCTGCCGCC 
ACAATTGCCA CCTCCCGTCA CATCAGACTC GTCATOAGCA ACCACGGCGC 

Ncor 

18 OX CGCGCCACCA GACATAATAG CTGACAGACT AACAGACTCT TCCTTTCCAT 
GCCCGGTGGT CTGTATTATC GACTGTCTCA TTGTCTGACA AGGAAAGCTA 



NCOl Fmll EcoKVNOtl 



1851 CS GTCTTTTC TGCAGTCACC GTCCTCGACA CCTGTGATCA CATATGGCGG 
CCCACAAAAG ACGTCAGTGG CAGCAGCTGT CCACACTXCT CTATAGCOCC 



KotI XbQZ KftfiX 



1901 CCGCTCTAGA CCAGGGGCCT GGATCGATCC GCGATGAAGA TTAACCOGAC 
GGCCAGATCT GG3CCGCCGA CCTAGCTAGG CGCTACTTCT AATTCGGCTG 



XSSI AGTGAGCGTA ATCTTCATCT CTCTTAGATT ATTTCTTTTC CACAGTAGGG 
TCACTCGCAT TAGAAGTAGA GAGAATCTAA TAAACAAAAG C7CTCATCCC 



2001 GTCGTCAGGT -CCTTTTCAAT CGTGTAACCA AAATAAACTC CACTAGAAGG 
CAGCAGTCCA CGAAAAGTTA GCACATTGGT TT7ATTTGAG C3CATCTTCC 



2051 ATATTGTGGG GCAACAACAC AATGGGCGTT ACACGAATAT TGCAGTCACC 
TATAACACCC CGTTGTTGTG T7ACCCGCAA TGTCCTTATA ACGTCAATGG 



2101 TCCTGA7CGA TTCAAGAGGA CATCATTCTT nCTTTGGGTA ATTATCCTTT 
AGCAC7AGCT AACTTCTCCT GTAGTAACAA AGAAAOCCAT TAATAGGAAA 



2151 TCCAAACAAC AT^^CCATC CCACTTCGAG TCATCCACAA TAGCACA2TTA 
AGGTTTCTTG TAAAAGGTAG GGTCAACCTC ACTAGGTGTT ATCGTGTAAT 



2201 CACCTTAGTC ATGTCGACAA ACTAOTTTGT CCTGACAAAC TGTCATCCAC 
CTCCAATCAC TACAGCTGTT TGATCAAACA GCA C T ST TT G ACAGTAGGTG 



2251 AAATCAATTG AGATCAGTTC GACTGAATCT CGAACCGAAT GGAGTGGCAA 
TTTACTTAAC TCTAGTGAAC CTGACTTAGA CCTTCCCTTA CCTCACCClT 



2301 CTGACGTGCC ATCTGTJ^JCT AAAACATGGG GCTTCAGGTC CGGTG1CCCA 
GACTGCACGG TAC-U^TTGA TT72CTACCC CGAACTCCAG GCCACAGGGT 



2351 CCAAAGGTGG TCAJ.TTATGA ACCTGGTGAA TGGGCTCAAA ACTCCTACAA 
GGTTTCCACC ACflTAATACT TCGACCACTT ACCCGACTTT TGACGATGTT 



2401 TCTTGAAATC AAAAAACCTG ACGCGAGTGA GTCTCTACCA CCAGCGOCAG 
AOAACTTTAG TTTTTTGGAC TCCOCTCACT CACAGA9CGT CGTCGCCGTC 



2651 ACGGGA7TCG GGGCTTCCCC CGGTOCCCGT ATCTGCACAA ACTATCAGGA 
TGCCCTAAGC CCCGMCGGG GCCACGGOCA TACACGTGTT 3CATAGICCT 



2S01 ACGGGACCGT GT CCCC GAGA CTTTGOCTTC CATAAAGAGG GTGCTTTCTT 
TGCCCTGGCA CZCWYTVC? GAAACGCAAG GTATT3CTCC -CAOGAAAGAA 
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2551 CCTGTATGAT CGACTTCCTT CCACAGTTAT CTACCGAGGA ACGACTTTCC 
GGACATACTA GCTGAACGAA CCTOTCAATA GATGGCTCCT TGCTCAAAGC 



2601 CTGAAGGTGT CGTTGCATTT CTGATACTGC CCCAAGCTAA GAAGGACTTC 
GACTTCCACA GCAACGTAAA GACTATGACG GGG7TCGATT CTTCCTCAAG 



2651 TTCAGCTCAC ACCCCTTGAG AGAGCCGGTC AATGCAACGG AGGACCCCIC 
AAG7CGAGTG TGGGGAACTC TCTCGGCCAQ TTACGTTGCC TCCTCCGCAG 



ECoRV 



2701 TAGTGGCTAC TATTCTACCA CAATTAGATA TCAGGCTACC GGTTTTGCAA 
ATCACCGATG ATAACATGGT CTTAATCTAT AGTCCGATCG CCAAAACCTT 



2751 CCAATGAGAC AGAGTACTTG TTCGAGGTTG ACAATTTCAC CTACGTCCAA 
GGT7ACTCTG TCTCATGAAC AAGCTCCAAC TGTTAAACTG GATCCAGGTT 



2801 CTTGAASCAA GATTCACACC ACAGTTTCTC CTCCACCTGA ATGACACAAT 
GAACTTAGTT CTAAG7GTCG TGTCAAAGAC GAGCTCGACT TACTCTGTTA 



2851 ATATACAAGT CGGAAAAGGA GCAA7ACCAC GGCAAAACTA ATTTGGAAGG 
TATATGTTCA CCCT7TTCC7 CGTTATGGTG CCCTTT7GAT 7AAACCTTCC 



2901 TCAACCCCGA AATTGATACA ACAATCGGGG AGTGGGCCT? CTGGGAAACT 
AGTTGGGGCT TTAACTATGT TGTTAGCCCC TCACCCGGAA GACCCTTTGA 



2951 AAAAAAAA CC TCACTAGAAA AATTCGCAG7 GAAGAGTTCT CTTTCACAGT 
TTTT7TTTCG AGTGATCTTT TTAAGCCTCA CTTCTCAACA GAAAGTGTCA 



3001 TGTATCAAAC GGAGCCAAAA ACATCAGTCG 7CAGAGTCCG GCGCGAACTT 
ACATACT7TG CCTCGGT1TT TGTAOTCACC AGTCTCAGGC CGCGCT7GAA 



3051 CTTCCCACCC AGGGACCAAC ACAACAACTG AAGACCAGAA AATCATGGCT 
GAAGGCTGGC TCCCTGGTTG TCTTGTTGAC TTCTGGTCTT T7AGTACCGA 



3101 7CAGAAAATT CC7C7CCAA7 GGTTCAAGTG CACAGTCAAG GAACCCAAGC 
AGTCTTTTAA GGACACGTTA CCAAGTTCAC GTG7CAG7TC CTTCCCTTCC 



3151 TOCAGTGTCG CATCTAACAA CCCTTGCCAC AATCTCCACG AGTCCCCAAT 
ACG7CACAGC GTAGATTGTT GGGAACCGTG TTACAGGTGC TCAGGGCTTA 



3201 CCCTCACAAC CAAACCAGCT CCGGACAACA GCACCCATAA TACACCCGTG 
GGCAGTGTTG GTTTGCTCCA GCCC7GTTCT CGTGCCTATT ATGTGGGCAC 



3251 TATAAACT7G ACATCTCTGA CGCAACTCAA GTTCAACAAC A7CACCGCAG 
ATATT7GAAC TGTAGAGACT CCGTTCAGTT CAACTTGTTC TAGTGGCCTC 



3301 AACAGACAAC tSACAGCACAG CCTCCGACAC TCCCTCTGCC ACGACCGCAG 
TtGSCTCTTQ CTCTCGTGTC <3GAGGCPGTG AGGGAGACGG TGCTGGCG3C 



3351 CCGGRCCCCC AAAAGCAGAG AACACCAACA CGAGCAAGAG CACTGACTTC 
GGCCTGGGGG TTTTCCTCTC TTCTGGTTGT CCTCGTTCTC GTGACTGAAG 



3401 C7GGACCCCG CCACCACAAC AAGTCCCCAA AACCACAGCC AGACCGCTGG 
GACCTCGGGC "GGTGGTG7TG TTCAGGGGTT TTGGTGTCGC TCTGGCGACC 



BNSDOCID: <WO 9932147A1_I_> 





WO 99/32147 

29 


PCT/US98/27364 


3451 


CAACAACAAC ACTCATCACC AAGAXACCGG AGAACAGAGT GCCACCAOCO 
GTTGTTCTTG TGACTAOTGG TTCTAffGGCC TCTTCTCTCA CCCTCGTCOC 




3501 


GGAAGCTAGQ CTTAATTACC AATACTATTC CTCOAOTCGC ACGACWSATC 
CCTTCGATCC CAATTAATCG TTATGATAAC -GACCTCACCO TCCTCACTAO 




3551 


ACAGGCGGGA GAAGAACTCG AAGAGAAGCA ATTGTCAATG CTCAAOCCAA 
7GTCCGCCCT CTTCTTGAQC TSCTCTTCGT TAACAGTTAC CAGTTGGCTT 




3601 


ATCCAACCCT AATTTACATT ACTGCACTAC TCACCATOAA GGTCCTGCAA 
TACGTTGGGA TTAAATCTAA TGACCTOATG ACTCCTACTT CCACGACGTT 




3C51 


TCGGACTGCC CTGGATACCA TATT20GGGC CAGCACCCGA -GGGAATTTAC 
ACCCTGACCG GACCTAKSGT ATAAAGCCCG C1CGTCGGCT CCCTTAAATC 




3701 


ATAGAGGCCC TAATGCACAA TCAAGATGGT CTAATCTCTC CGTTGAGACA 
TATCTCOCCG ATCACGTGT? AGTTCTACCA AATTACACAC CCAACTCTGT 




2751 


GC7GGCCAAC GACACGACTC AAGC7CTTCA ACTCTTCCTG AGAGCCACAA 
CGACCGGTTG CTCTGCTGAG 7TCGAGAAGT TGACAAGGAC CCTCGGTCTT 




3801 


C7GAGCTACG CACCTTTTCA ATCCTCAACC CTAAGGCAAT 2GATT7CTTG 
GACTCCATGC GTGGAAAAGT TAGGAGTTC5 CATSCCCTTA ACTAAAGAAC 




3851 


CTGCAGCGAT GGGCCGGCAC ATGCCACATT CTGGGACCGG ACTGC7GTAT 
GACCTCGCTA CCCCGCCGTG tTACGCTGTAA GACCCTGGCC TGACGACATA 




3901 


CGAACCACAT GATTGGACCA ACAACATAAC AGACAAAATT GATCAGATTA 
GCTTGGTGTA CTAACCTGGT TCTTGTATTG TCTGTTTTAA CTACTCTAAT 




3951 


TTCA1-CATTT TGTTGATAAA ACCCTTCCCG ACCAGGCGGA CAATGACAAT 
AAGTACTAAA ACAAC.ATTT TGGGAAGGCC TGCTCCCCCT CTTACTGTTA 




4001 


TGG7GCACAG CATGukGACA ATGGA7ACCG GCAGGTATTG GAGTTACAGG 
ACCACCTGTC CTACCTCTGT TACCTATGGC CCTCCATAAC CTCAATGTCC 




4051 


CG77ATAATT GCAGTTATCG CTTTATTCTG TATA7GCAAA TTTGTCTTTT 
GCAATATTAA CGTCAATAGC GAAATAAGAC A7ATACGTTT AAACAGAAAA 




4101 


ACTTTTTCTX CAGATTGCTT CATCCAAAAG CTCAGCCTCA AATCAATCAA 
TCAAAAACAA GTCTAACGAA CTACCTTTTC GAGTCGGAGT TTACTTACTT 




4151 


ACCAGGATTT AATTATATGG ATTACTTSAA TC7AAGATTA CTTGACAAAT 
TGGTCCTAAA TTAATATACC TAATGAACTT AOATTCTAAT -GAACTCTTTA 




4301 


GATAATXTAA TACACTGGAG CTTTAAACAT AGCCAATGTG A7TCTAACTC 
CTATTATATT ATGTGACCTC GAAA7TTCTA TCGGTTACAC TAAGATTGAG 




4251 


CTTTAAACTC ACAGTTAATC ATAAACAAOG WTGGTACCC AGCTCSGAATT 
GAAATTTGAG TGTCAATTAG TATTTGTTCC AAACCAVGGC TCGAGCTTAA 




4301 


ATCTCCTGTG CCTTCTAGTT GCCAGCCATC TGTTCCTTQC CCCTOCGCCO 
TAGACGACAC GGAAGATCAA CGGTCGGTAG ACAACAAACC GGGA<3GCCCC 




4351 


TGCCTTCCTT GACCCTGGAA GGTG0CAC7C CCACTCTCCT TTCCTAATAA 
ACGGAACGAA CTGGGACCTT CCACGGTGAG ^GT^5%CA0GA AAGGATTACT 
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4401 WCAGGAAA TTGCA2CGCA TTGTCTGAGT AGGTGTCATT C?ATTCTCGC 
TTACTCCTTT AACCTAGCGT AACAGACTCA TCCACAOTAA GATAACACCC 

4 4 SI GGGTGGGGTG GGGCACCACA GCAAGGGGGA GGJOTGGCAA GACAATACCA 
CCCACCCCAC CCCCTCGTGT CGTTCCCCCT CCTAACCCTT CTGTTATCGT 

sphz 

4501 GCCASGCCCG GGMCCCOTO GGC7CTATGG GTACCCACCT GCTGAACAAT 
CCGTACCACC CCTACGCCAC CCGAGATACC CATGGCTCCA CGACTTCTTA 

4551 TOACCCGGTT CCTCCTGCOC CAGAAAGAAO CAGGCACATC CCCTTCTCTG 
ACTCCGCCAA CGAGGACCCC GTCTTTCTTC GTCCGTGTAG GCCAAGAGAC 

4601 TGACACACCC TCTCCACGCC CCTGGTTCTT AGTTCCAGCC CCACTCATAO 
ACTCTGTGGG ACAGGTGCGG GGACCAAGAA TCAAGGTCGG GCTGAGTATC 

4651 GACAC7CATA GCTCAGGACG GCTCCGCCT? CAATCCCACC CCC7AAAGTA 
CTGTGAGTAT CGAGTCCTCC CGACGCGGAA CTTAGGGTGC CCGATTTCAT 

4701 CTTGGAGCCS TCTCTCCCTC CCTCA7CAGC CCACCAAACC AAACCTAGCC 
GAACCTCGCC AGAGAGGGAG CCAGTAGTCC CGTGGTTTGG TTTGGATCGG 

4751 TCCAACAGTG GCAAGAAATT AAAGCAAGAT AGGCTATTAA GTGCACAGGG 
AGGTTCTCAC CCTTCTTTAA TTTCGTTCTA TCCCATAATT CACGTCTCCC 

4801 AGAGAAAATG CCSCCAACAT CTGAGGAACT AATGAGAGAA ATCA7AGAAT ' 
TCTCTTTTAC CGAGGTTGTA CAC3CCTTCA TTACTCTC7T TAGTATCTTA 

4851 TTCTTCCGCT TCCTCGCTCA CTGACTCCC? GCGCTCGGTC CTTCGGCTOC 
AAGAAGGCGA AGGAGCCAGT GACTGAGCGA CCCGAGCCAC CAAGCCGACG 

4901 GGCCAGCGGT ATCAGCTCAC TCAAAGGCGG TAATACGGTT ATCCACAGAA 
CCGCTCCCCA TACTCGAGTG AGTTTCCGCC A77ATGCCAA 7AGCTGTCTT 

4SS1 TCAGGGGATA ACGCAGGAAA CAACATGTGA GCAAAACCCC AGCAAAAGGC 
AGTCCCCTAT TCCGTCCTTT CT7GTACACT CGTTTTCCGG TCGTTTTCCG 

£001 CAGGAACCG? AAAAA GGCCC CGTTGCTCGC GTTTTTCCAT ACGCTCCGCC 
GTCCTTGGCA TTTTTCCCGC GCAACGACCG CAAAAAGG7A TCCGACGCGG 

5051 CCCCTGACGA GCATCACAAA AATCGACGCX CAAGTCACAG GTGGCCAAAC 
GGGGACTGCT CCTAGTGTTT TTAGCTCC<5A GTTCAGTCTC CACCGCTTTC 

5101 CCGACAGGAC 7ATAAAGA7A CCAGGCGTTT CCCCCTGGAA GCTCCCTCCT 
. . . . ^^T: ATXTTTCTAT GCTCCCCAAA CGGGGACCTT SagSgACCA 

5151 ??E?? ,CCT GTTCCGACCC TOCCGCITAC CGGATACCTO TCCGCCTTTC 

CGCCAGAGGA CAAGGCWGC acgocgaato gcctatggac acccggaaao 

5201 ISS S5£E£2£ TAGG^lcic 

acggaagccc togcaccgc caaagagtta cgagtgcgac atccatacao 
5251 agttcgctgt aggtggttcg ctccaagctg cgctgtgtgc acgaaccccc 

TCAAGCCACA TCCAGCAACC -GAGGTTCGAC CCGACJOCO TGCTTG^ 
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5301 CGTTCACCCC GACCOCTGCC GCTTATCCGO TAACTAMGT CTTCAGTCCA 
GCAAGTCGGG CTGGCGACGC GGXATAGGCC ATTGATAGCA GAACTCACCT 



5351 ACCCGGTAA6 ACACGACTTA TCGCCACTGG CAGCAOCCAC TGGTAACACO 
rCCGCCATTC TGTGCTGAAT ACCGGTGACC CTCGTCGCTG ACCATTCTCC 



5401 ATTAGCAGAG CGAGCTATGT AGCCGOTOCT ACAGAGTTCT TGAAGTGGTG 
TAATCCTCTC GCTCCAEACA TCCGCCACCA TCTCTCXAGA ACT9CACGAC 



£451 GCCTAJWCTAC GGCTACACTA CAAGGACACT ATTTGCTATC TGCGCTCTGC 
CGGATTCATG CCGATGTGAT •CTTCCTCTCA 7AAACCATAG ACGCGAGACG 



5501 TGAAGCCAGT TACCTTCCGA AAAAGACTTG GTACCTCTTO AVCCGGCAAA 
ACTTCGGTCA ATG6AAGCCT TTTTCTCAAC CATCGAGAAC TAGGGOGTTT 



5551 CAAACCACCG CTGGTAGCCG TGG TTTTTTT GTTTGCAAGC AGCAGATTAC 
CTTTGG7GGC GACCATCQCC ACCAAAAAAA CAAA0G7TCG 9CGTCTAATG 



SS01 CCGCAGAAAA AAACGATCTC AA6AAGATCC TTTGATCTTT SCTAGGGGGT 
CGCCTCTTTT TTTCCTAGAO TTCTTCTAGG AAACTAGAAA AGATGCCGCA 



5*51 CTGACGCTCA CTGGAACGAA AACTCACGTT AAGGGATT7? -GG7CATGAGA 
GACTGCGAGT CACCTTGCTT TTGAGTGCAA TTCCCTAAAA CCACTACTCT 



5701 WATCAAAAA GGATCTTCAC CTAGATCCTT TTAAATTAAA AAtGAAGTTT 
AATAGTTTTT CCTAGAAGTC GATCTAGGAA AATTTAATTT TTACTTCAAA 



5751 TAAACCAATC TAAAGTATAT ATGAGTAAAC 7TGG2CTGAC AGTTAOCAAT 
AT7TAGTTAG ATinCATATA TACTCAWTG AACCAGACTG SCAATCGTTA 



5001 GCTTAATCAG TGAGGCACCT ATCTCAGCGA TCTCTCTATT TCCTOCATCC 
CGAATTAGTC ACTCCGTGGA TAGAGTCGCT AGACAGATAA AGCAAGTAGG 



5fi51 ATAGTTGCCT CACTCCCGGO GGGCCGOGCG C3GAGCICTG CCTCCTGAAG 
TA'XCAACGGA CTGAGGCCCC CCCCCCCCGC -GACTCCAGAC CGftGCACWC 



5901 AAGGTGTTGC TCACTCATAC CAGGCCTGAA TCCCCCCATC ATCCAGCCAG 
TTCCACAACG ACTGAGTATG GTCCGGftCTT AGCGGGCTAG TAGCTCGGTC 



•5951 AAAG7GAGGG AGCCACGGTT -GATGAGAGCT TT GT T GTA GG TGGACCAGTT 
TTXCACTCCC 1CCGTGCCAA CTACTCTCCA AACAACATCC ACCTGGTCAA 



6001 GGTGATTTTG AACTXTTGCT TTGCCACGCA ACGGTC7GCG TTCTCGGCAA 
CCACTAAAAC TTGAAAACGA AACGGTGGCT TCCCAGACGC AACAGCCCTT 



6051 GATCCGTGAT CTGATCCTOC A&CTCAGCAA AAGTTOGATT TATTCA7LCAA 
CTACGCACTA GACTAGGAAG TTGAGTCGTT TTCAAGCTAA ATAAGTTGTT 



6101 AGCCGCCGTC CCCTCAACTC AGCG7AATGC TCTGCCAGTG TTACAAOCAA 
TCGGCGGCAG GGCAGTTCAG TCGCATTACG AGACGG7CAC AATGCTCGTT 



6151 TTAACCAATT -CTGATTAGAA AAACTCATCG AGCATCAAAT GAAACTCCAA 
AATTGGTTAA GACTAATCTT TTTGAGTAGC TCGTAGTTTA CTrTGAQGTT 



6201 TTTATTCATA TCAGGATTAT CAATAOCATA TTTTTCAAAA AOGCGTTTCT 
AAA7AAGTA? A5TCCTA.\TA OTTATGGTAT AAAAACTTTT TCGGCAAAGA 
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€251 CTAATGAAGG AGAAAACTCA CCGAGGCAGT TCCATAG6AT GGCAAGATCC 
CA7TACTTCC TC TTTTG A6? GGCTCCOTCA AGCTATCCTA CCGTTCTAGC 



6301 TGGTATCGG? CTGCGATTCC GACTCGTCCA ACATCAATAC AACCTATTAA 
ACCA7AGCCA GACGCTAAGG C7GACCAGG7 TGTAGTTATC TTGGATAATT 



€351 TTTCCCCTCC TCAAAAATAA GGTTATCAAG TGAGAAATCA CCATGAGTGA 
AAAGGGGAGC AGTTTTTATT CCAATAGTTC ACTCTTTAGT GGTACTCACT 



Hindlll 



6401 CGACTGAATC CGGTGAGAAT GGCAAAAGCT 7ATGCATTTC TTTCCAGACT 
GC7SACTTAC GCCACTCTTA CCGTTTTCGA ATACGTAAAG AAAGGTCTGA 



6451 TGTTCAACAC GCCAGCCATT ACGCTCGTCA TCAAAATCAC TCGCATCAAC 
ACAAGTTGTC CGGTCGGTAA TGCGAGCACT AGTTTTAGTG AGCGTAGTTG 



PVUI 

6501 CAAACCGTTA TTCATTCGTG ATTGCGCCTG AGCGACACGA AA7ACGCGA7 
GT7TCGCAAT AAGTAAGCAC TAACGCGGAC TCCCTCTGCT TTATGCGCTA 



Pvul 

6551 CCCTGTTAAA AGGACAATTA CAAACAGGAA TCGAATCCAA CCGGCCCAGG 
GCGACAATTT TCCTGTTAAT GTTTCTCCTT AGCTTACGTT GCCCGCGTCC 



€601 AACACTGCCA GCGCATCAAC AATATTTTCA CCTGAATCAG CATA7TCTTC 
TTGTCACGGT CGCGTAG77G TTATAAAAGT CCACTTAGTC CTATAAGAAG 



6651 TAATACCTGG AATGCTGTTT TCCCGGGGAT CGCAGTGGTG AGTAACCATG 
ATTATGGACC TTACGACAAA AGGGCCCCTA GCGTCACCAC TCATTGGTAC 



6701 CATCATCAGG AGTACGGATA AAATGCTTGA TGGTCGGAAG AGGCATAAA7 
GTACTAGTCC TCATGCCTAT TTTACGAACT ACCAGCCTTC TCCGTATTTA 



6751 TCCC7CAGCC AGTTTAGTCT GACCATCTCA 7C7G7AACAT CATTCGCAAC 
ACGCAGTCGG TCAAATCAGA CTGGTACAGT AGACATTGTA GTAACCCTTG 



6801 GCTACCTTTG CCATG7TTCA GAAACAAC7C TGGCGCA7CG GGCTTCCCAT 
CGATGGAAAC GGTACAAAGT CTTTGTTGAG ACCGCGTAGC CCGAAGGGTA 



6651 ACAATCGATA GATTGTCGCA CCTGATTCCC CGACATTATC CCGAGCCCAT 
TGTTAGCTAT CTAACAGCCT GGACTAACGG GCTGTAATAG CGCTCGGGTA 



Xhol 

6901 TTA7ACCCAT ATAAATCAGC ATCCATGTTG GAATTTAATC CCGGCCTCGA 
AATATGGGTA TATTTAGTCG TAGOTACAAC CTTAAATTAG CGCCCGAGCT 



XhDZ 

6951 GCAAGACGTT TCCCG7TGAA TATGGCTCAT AACACCCCTT GTATTACTGT 
CCTTCTGCAA AGGGCAACTT ATACCGAGTA TTGTGGGGAA CATAATGACA 



7001 TTATGTAAGC AGACAGTTT? ATTGTTCATC ATGATATATT TTTATCTTGT 
AATACATTCG TCTGTCAAAA TAACAAGTAC TACTATATAA AAATAGAACA 
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DraZZI 



7051 GCAATGTAAC ATCAGAGATT TTCAGACACA ACCTGOCTTT CCCCCCCCCC 
CCTTACATTG TAGTCTCTAA AA CT CTCTCT TGCACCGAAA CGGGGGGGGG 



7101 CCATTATTGA AGCATTTATC AGGGTTATTG TCTCATGAGC CGATACATAT 
CGTAATAACT TCGTAAATAC TCCCAATAAC AGAGTACTCC CCTATCTATA 



71S1 TTGAATGTAT TTAGAAAAAT AAACAAATAC GGGTTCCCCC CACATTTCCC 
AACTTACATA AATCTTTTTA TTTGTTTATC -CCCAAGGCGC 'GTGTAAAOGO 



7201 CGAAAAGTGC CACCTOACGT CTAAGAAACC ATTATTATCA TCACATTAAC 
GCTTT7CACG CTGGACTGCA CATTCTTTCC TAATAATAGT ACTGTAATTG 



7251 CTATAAAAAT AGGCGTATCA CGAGGCCCTT TCCTC 
GATATTTTTA TCCGCATAGT GCTCCGGGAA AGCAG 
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pVE 10I2-SO?(Z1 

General D scriptlon 

DNX pVR 1012-SCPtt) 
local object 

Created! 09/14/98 04.29FM 

Laat Modified* 09/15/98 04t50PM 

length: 7272 bp 

storage type: Basic 

form Circular 



Restriction Map 

Drain- 1 site CAgnwpTG 

Wndllfclsite 

Hpainsfte 

Kpnl:1$ite 

Not!: 1 site |!§gj§gj§ 

Pmtl:1$ite 

Pvufnsite 

Sacll: 1 site cccgrc 

Xbafclsite S» 

XhoMsrte 

EcoRVrasites «»» 
Ncot:2sHes =gg 
Noel: 2 sites 
Spru:2$ites ggjgj 

Functional Map 

CDS (4 signals) 

CMVIES'UT 

start i 888 sadt 112* 
CMV IE INT 

Start: 1130 Badt 1640 
TbGH 

Start: 4289 Eadt 4841 

Kanr 

Start t 6337 Endi «959 (Cooplameatcry) 

Misejeature (2 signals) 
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CMV enhancer 

Starts 248 End: 885 
SGP(Z) 

Start* 1870 End* 4288 

Annotations 
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1 TCCCGCOTTT CGSTGATGAC GGTGAAAACC TCTGACACAT GCAGCTCCCO 
ACCGCGCAAA GCCACTACTG CCACTTTTGG AGACTCTGTA CCTCOAGCCC 



SI GACACGG7CA CACCTTGTCT GTAAGCGGAT CCCGGGAGCA GACAAOCCCG 
CTC7GCCA6T GTCGAACAGA CATTCGCCTA CGGCCCTCGT CTG7TCGCCC 



101 TCAGCGCGCG TCAGCGGGTG TTGGCGGG7G TCGGGGCTGG CTTAACTATG 
ACTCCCGCCC AGTCGCCCAC AACCGCCCAC AGCCCCGACC GAATTGATAC 



Ndcl 

151 CGGCATCACA GCAGATTGTA CTGAGAGTCC ACCATATGCG GTGTGAAATA 
CCCGTACTCT CGTCTAACA? GACTCTCACG TCCTATACGC CACACTTTAT 



201 CCOCACAGAT GCCTAAGGAG AAAATACCGC ATCAQATTGG CTATTGCCCA 
GGCCTGTCTA CGCATTCCTC TTTTATGGCG TAGTCTAACC GATAACCGGT 



251 TTGCATACGT TGTATCCATA TCATAAIATG TACATTTATA TTGGCTCATO 
AACGTATGCA ACATAGGTA? ACTATTA7AC ATGTAAATA? MCCGAGTAC 



301 TCCAACATTA CCGCCXTGTT GACATTGATT ATTGACTAGT TAT7AATAGT 
AGCTTGTAAT GGCCCTACAA CTGTAACTAA TAACTGATCA ATAATTATCA 



351 AATCAATTAC GCGGTCATTA GTTCXTAGCC CATATATGGX G7TCCGCCTT 
77AG77AATG CCOCACTAAT CAAGTATCGG GTATATACCT CAAGGCGCAA 



401 ACA7AACTTA CCGTAAA?CG CCCGCCTGGC TCACCGCCCA ACGACCCCCG 
TGTATTGAAT GCCATTTACC GCGCGGACCG ACTGGCGCGT 7GCTGGGGGC 



451 CCCATTGACG TCAATA&TGA CGTATGTTCC CATAGTAACG CCAATAGGGA 
GGGTAACTCC AGTTATTACT CCATACAAGG GTATCATTGC GCTTATCCC- 



501 CTTTCCATTC ACGTCAATGG GTGGAGTATT TACGGTAAAC TGCCCACTTC 
GAAAGGTAAC TCCAGTTACC CACCTCATAA ATGCCATTTC ACGGGTCAAC 



5S1 GCACTACA1C AAGTCTATCA TATGCCAACT ACGCCCCCTA TTGACGTCAA 
CGTCATCTAG TTCACATACT ATACCCTTCA TCCGGGGCAT AACTGCAGTT 



$01 TCACGGTAAA TGGCCCGCCT GGCATTATGC CCAG7ACATC ACCT7ATGGG 
ACTGCCATTT ACCGGGCGGA CCCTAATACG GGTCATGTAC TGGAATACCC 



Nool 

651 ACTT7CCTAC TTGGCACTAC ATCTACGTAT TAGTCATCGC TATTACCATC 
TCAAAGGATG AACCGTCATG TAGATGCATA ATCAGTAGCC ATAATGCTAC 



NCOZ 

701 OTGATGCGGT TTTGGCAGTA -CATCAATGGG CGTCGATAGC GGTTTGAC7C 
CACTACGCCA AAACCGTCAT "CTAGTIACCC GCACCTATCG CCAAACTGAG 



751 ACCGGGATTT CCAAG1CTOC ACCCCATTGA CGTCAATGGG AGTITCTTI'T 
TGCCCCTAAA GGTTCAGAGC TGGGGTAACT GCAGTTACCC TCAAACAAAA 



801 GGCACCAAAA TCAACGGGAC TTTCCAAAAT GTCGTAACAA CTCCGCCCCA 
CCCTGGTTT? AGTTGCCCTG AAAGGTTTTA CAGCATTCi u 7 GAGCCGGGGT 
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0S1 TTCACGCAAA TGGGCGCTAG GCCTGTACGG TGGGAGClCT ATATAAGCAG 
AACTGCCTTT ACCCGCCATC CGCACATGCC ACCCTCCAGA TATATOCCTC 



901 AGCTCGTTTA GTGAACCGTC AQATCOCCTO GAGACGCCAT CCACGCTGTT 
TCGAGCAAAT CACTTGGCAO TCTAGCGGAC CTCTGOGGTA GGTGCCACAA 



sacii 



951 TTGACCTCCA TAGAAGACAC CGGGACCGAT -CCAGCCTCCC CGGCCGGGAA 
AACTGGAGGT ATC TT C TGTG GCCCTGGCTA CGTCGGAGGC GCGCCCCCTT 



1001 CGGTGCATTG GAACGCGGAT TCCCCGTGCC AAGAGTGACG TAAGTACCGC 
GCCACGTAAC CTTGCGCCTA AGGGGCACGG TTCTCACTGC ATTCATGGCC 



SpbZ 



1051 CCA7AGACTC TATACGCACA CCCC7TTGGC TCTTATCCAT GCTATACTGT 
GATATCTGAG ATATCCGTGT GGGGAAACCG AGAATACGTA CGATATGACA 



1101 TTTTCGCTTG GGGCCTATAC ACCCCCGCTT CCTTATGCTA TAGGTGATGG 
AAAACCGAAC -CCCGGATATG TGGGGGCGAA GGAA.ACGAT ATCCACTACC 



1151 TATAGCTTAC CCTATACGTG 7GGGTTATTC ACCATTATTG ACCACTCCCC 
ATATCGAATC GGATA7CCAC ACCCAATAAC 7GGTAAXAAC TGGTOAGGGG 



1201 TATTGGTCAC GATACTTTCC ATTACTAATC CATAACATGG CTCTTTGCCA 
A7AACCACTG CTATGAAAGG TAATGATTAG CTATTG7 ACC GAGAAACGG7 



1251 CAACTATC7C TATTGGCTAT ATGCCAATAC TCTGTCCTrC AGAGACTGAC 
GTTGATAGAG ATAACCCATA TACGGTTATG AGACAGGAAG TCTCTGACTG 



1301 ACGGACTCTG TATTTTTACA GGATGGCGTC CCATTTATTA TTTACAAATT 
TGCCTGAGAC ATAAAAATGT CCTACCCCAG GGTAAATAAT AAATCTTTAA 



1351 CACATATACA ACAAGGCCGT -CCCCCGTGCC CGCAGTTTTT ATTAAACATA 
CTGTATATGT T G TTQCGGCA GGGGGCACGG GCGTCAAAAA TAATTTGTAT 



1401 GCGTGGGATC TCCACGCGAA TCTCGGCTAC GTGTTOCGGA -CATGGGCTCT 
CGCACCCTAG AGOTGCGCTT AGAQOCCATG CACAAGGGCT GTACCCGAGA 



1451 TCTCCGGTAG CCGCGGAGCT TCCACATCGG AOCCCTGGTC CCATGCCTCC 
AGAGCCCATC GCCGCC7COA AGGTGTAGGC TCGGGACCAG GGTACGGAGG 



1501 AGCGCCTCAT CGTCGCTCOG •CAOCT GCTTC CTOCTAACAG TGGAGGCCAG 
7CGCCGAG7A -CCAGCGAGCC GVCGAGGAAC GAGGATTGTC ACCTCCGCIC 



1551 ACTTAGGCAC ACCACAATGC CCAOCACCAC •CAGTGTGCCC CACAACCCCG 
TGAATCCG7G TCGTGTTACG GGTCGTGGTG G2CACACGGC G7GTTCCGGC 



1601 TGGCGGTAGG GTATGTGTCT GAAAATGAGC GTGGAGATTG GGCTCGCACG 
ACCGCCATCC CATACACAGA -CTTJTACTCG CAGCTCTAAC CCGAGOGTGC 



1651 GCTGACGCAC ATGGAAGACT TAAGGCAGCG -GCAGAACAAG ATGCAGGCAC 
CGACTGCCTC TACCTOCTOA ATTCOGtPCGC CGTCTTCTTC 7ACGTCCGTC 



1701 CTGAGTTCTT GTATTCTGAT AAGAGTCAGA "GGTAACTCCC CTTGCGGTCC 
GACCCAACAA «CA5>AAGWC*VA TTCTCAffTCT CGATTGAGGG <*ACGCCACG 
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HpaX 



1751 7GTTAACGGT GGACGGCAGT GTAGTCTGAG CACTACTCGT TGCTGCCCCG 
ACAATTGCCA CCTCCCGTCA CA7CAGACTC CTCATGACCA ACGACCGCGC 



Hcol 

1B01 CGCCCCACCA GACATAATAG CTGACAGAC? AACACACTG7 TCCTTTCCAT 
CCGCGGTGGT CCCTATTATC GACTGTCTGA TTCTCTGACA AGGAAAGCTA 



Ncol Pali EcoRVHotI 



1851 GGCTCTTnC TGCAGTCACC CTCGTCGACA CGTCTGATCA CATATCGCGC 
CCCAGAXAAO ACCTCAG7GG CAGCAGCTGT GCACACTAGT CTATAGCGCC 



NotI XbftZ 



1301 CCGC7CTAGA CCAGGCGCCT GGATCGAATT GATGAAGATT AAGCCGACAG 
GGC3AGATCT GGTCCGCGGA CCTAGC77AA CTACTTCTAA TTCGGCTGTC 



1951 TGAGCGTAAT CTTCATCTCT CTTAGATTAT TTGTTTTCCA GACTAGGGGT 
ACTCGCATTA GAAGTAGAGA GAATCTAATA AACAAAAGGT CTCATCCCCA 



2001 CCTCAGGTCC TTTCCAATCG TGTAACCAAA ATAAACTCCA CTAGAAGGAT 
CCAGTCCAGG AAAAGTTAGC ACA77CGTTT TATTTGAGGT GATCTTCCTA 



2051 AVTGTGGGGC AACAACACAA TGGGCGTTAC AGGAATATTG CAGTTACCTC 
TAACACCCCG TTGTTG7GTT ACCCGCAATG TCCTTATAAC GTCAATGGAG 



2101 GTGATCGATT CAAGAGGACA TCATTCTTTC TTTGGGTAAT TATCCTTTTC 
CACCAGCTAA GTTCTCCTCT AGTAACAAAG AAACCCATTA ATAGCAAAAG 



2151 CAAAGAACAT TT7CCATCCC AC77GGAGTC ATCCACAATA GCACATTACA 
GT7TCTTGTA AAAGGTAGGG TGAACC7CAG TAGGTGTTAT CGTGTAATGT 



2201 GGiVAGTGAT CTCGACAAAC TAGTTTGTCG TGACAAACTG TCATCCACAA 
CCAATCACTA CAGCTGTTTG ATCAAACAGC ACTGTTTGAC AGTAGGTGTT 



2251 A7CAATTGAG ATCAGTTGGA CTGAATCrCG AAGGGAATGG AGTGGCAACT 
TAGTTAACTC TAGTCAACC? GACTTAGAGC TTCCCTTACC TCACCCTTGA 



2301 GACCTCCCAT CTGCAACTAA AAGATGGGGC TTCAGGTCCG GTGTCCCACC 
CTGCACGGTA GACGTTGATT TTCTACCCCG AAGTCCAGGC CACAGGGTGG 



2251 AAAGGTGG7C AATTATGAAG CTGGTGAATG GGCTGAAAAC TCCTACAATC 

TTTCCACCAG ttaatacttc caccacttac CCGACTTTTC ACGATGTTAG 



2401 TTGAAA7CAA AAAAC CTGAC GGGAGTGAGT GTCTAC CAQC AGCGCCAGAC 
AACTTTAGTT TTTTGGACTG CCCTCACTCA CAQATGGTCG TCGCGGTCTG 



2451 GCGATTCGGG GCTTOCCCCC CTGCCGCTAT GTGCACAAAG TATCAGGAAC 
CCCTAAGCCC CGAAGGGGGC CACGGCCATA CACGTGTTTC ATACTCCTTG 



2501 GGCACCGTCT GCCGGAGACT TTGCCTTCCA TAAAGAGGGT GCTTfCTTCC 
GCCTGGCACA CGGCCTCTGA AACGGAAGGT ATTTCTCCCA CGAAAGAAGG 



2551 TGTATCATCC ACTTGCTTCC ACAGTTATCT ACCCAGGAAC CACTTTCGCT 
ACATACTAGC TGAACGAAGG TGTCAATAGA TGGCTCCT7G CTGAAAGCGA 
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2601 CAAGGTCTCG TTGCATTTCT GATACTCCCC CAAGCTAAGA AGGACTTCTT 
CTTCCACAGC AACGTAAAGA CTATGACCCG GTTCGATTCT 1CCTGAAGAA 



2651 CAOCTCACAC CCCTTGAGAG AGCCGGTCAA TOCAACGGAC GACCCCTCTA 
GTCCAGTGTG GGGAACTCTC TCGGCCAGTT ACGTTGCCTC CTGCGCAOAT 



€coRV 



2701 GTGGCTACTA TTCTACCACA ATTAGATATC AGGCTACCGG TTTTGGAACC 
CACCGATGAT AAGATGGTGT 7AA7CTATAG TCCGATGGCC AAAACCTTGO 



27S1 AATGAGAGAG ACTACTTCTT CCAGGTTGAC AATTTCACCT ACGTJCCAXCT 
TTACTCTGTC TCATCAACAA GCTCCAACTG TTAAACTGGA TGCACCTTGA 



2801 TGAATCAAGA TTCACACCAC AGTTTCTOCT CCAGCTOAAT -GAGACAATAT 
ACT7AGTTCT AAGTGTCCTG TCAAAGftCGA GGTCGXCTTA CTCTGTTATA 



2«51 ATACAACTGG GAAAAGGACC AA7ACCACGG GAAAACTAA? TTGGAAGGTC 
TATGTrCACC CTTTTCCTCG TTATGGTGCC CTTTTGATTA AACCTTOCAG 



2901 AACCCCCAAA TTGATACAAC AATCGGGGAG TGGGCCWCT GGCAAXCTAA 
TTGGGGCTTT AACTATGTTG TTAGCCCCTC ACOCGGAAGA CCCTTTGATT 



2951 AAAAACCTCA CTAGAAAAAT TCGCAGTGAA GAGTTCTC7T TCACAGTTCT 
TTTTTGGAGT GATCTTTTTA AGCG7CACTT CTCAACAGAA AGTGTCAACA 



3001 ATCAAACGGA GCCAAAAACA TCAGTGCTCA GACTCCGGCG CGAACTTCTT 
TAGTTTCCCT CGCTTT7TGT AGTCACCAGT CTCAGGCCGC -GCTTCAAGAA 



3051 CCGACCCAGG GACCAACACA ACAACTGAAG ACCACAAAAT CATCGCTTCA 
GGCTGGCTCC CTGGTTGTGT TGTTGACTTC TGGTCOTTTA GTACCGAACT 



3101 GAAAATXCT CTGCAATGGT TCAAGTCCAC ACTCAAGGAA GGGAAGCTGC 
CTTTTAAGGA GACUTTACCA AG77GACGTG TCAGTTCCTT OCCTTCGACC 



3151 AGTGTCGCAT CTAACAACCC TTCCCACAAT CTCCACGAGT GCCCAATCCC 
TCACAGCGTA GAT7GTTGGG AACGGTCTTA GAGCTGCTCA GGGGTTAGGG 



3201 TCACAACCAA ACCAGGTCCG CACAACAGCA CCCA7AATAC ACC0C7GTAT 
AGTGTTGGTT TGGTGCAGGC CTGTTGTCGT GCGTATTATG TGGGCACATA 



3251 AAACTTGACA TCTCTGXOGC AACTCAACTT CAACAACATC ACCGCAGAAC 
TTTGAACTGT AGAGACTCCG TTGACTTCAA C7TG71CTAG TCGGGTCTTG 



3301 AGACAACGAC AGCACAGCCT CCGACACTCC CTCTOCC*CG ACC CCAG CCC 
TC ro WGCTO TCGTGTCCGA GCCTGTGAGG -GAGACCGTGC TGGCGTCGGC 



3351 GACCCCCAAA AGCAGAGAAC ACCAACACGA CCAAGAGCAC TGACTKCTG 
CTGGGGGTTT TCGTCTCTTC TGGTTGTCCT CGTTCTCGTG ACTGAACGAC 



3401 GACCCCGCCA CCACAACAAG TOCCCAAAAC CACAGCGA&A CCQ CTGQCAA 
CTGGGGCGGT CCTG T T CT T C AGGCGTTTTC <5TGTOCCTCT GGCGAGCGTT 



3451 CAACAACACT CATCACCAAG A7ACCGGAGA AGACASTCCC ACCAGCGGGA 
GTTCTTGTGA GTAGTGCTTC TATGGCCTCT TCTCP CA CCC TCGTCCOCCT 
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3501 ACCTAGGCTT AATTACCAA? ACTATTGCTG CAGTCGCACG ACTGATCACA 
TCGATCCCAA T7AATGCTTA TCATAACGAC CTCAGCGTCC TGACTAGTCT 
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3551 GGCCGGAGAA GAACTCGAAG AGAAGCAATT CTCAATGCTC AACCCAAATC 
CCGCCCTCTT CTTGAGCTTC TCTTCGTTAA CAGTTACGAG TTGGGTXTAC 



3601 CAACCCTAAT TTACATTACT GGACTACTCA CCATGAACGT GCTCCAATCC 
CTT06GATTA AATGTAATGA CCTCATGACT CCTACTTCCA CCACGTTACC 



3651 GACTGCCCTG CATACCATAT 7TCGGGCCAG CAGCCGAGGG AATTTACATA 
CTGACCGGAC CTATGGTATA AAGCCCCGTC CTCGGCTCCC TTAAATGTAT 



3701 GAGCGGCTAA TGCACAATCA AGATGGTTTA A7C7GTGGGT TGAGACAGCT 
CTCCCCGATT ACCTGTTACT TCTACCAAAT TAGACACCCA ACTCTGTCGA 



3751 GCCCAACGAG ACCAC7CAAG CTCTTCAACT CTTCCTGAGA GCCACAACTG 

ccggttcck: tcctgagttc GAGAAGTTGA caaggxctc? CGGTGTTGAC 



3801 AGCTACGCAC CTTTTCAATC CTCAACCGTA AGGCAATTGA TTTCT7GCTG 
TCGATGCGTG GAAAAGTTAG GAGTTGGCAT TCCCTTAACT AAAGAACCAC 



3851 CAGCCATGGG GCGGCACATG CCACATTCTG GGACCGGACT GCTGTA1CGA 
GTCCCTACCC CGCCCTGTAC CGTG7AXCAC CCTCGCCTGA CGACATAGCT 



3901 ACCACATGAT TGGACCAAGA ACATAACAGA CAAAATTGAT CAGATTATTC 
TG6TCTACTA ACC7GG7TCT TCTATTCTCT GTTTTAACTA CTCTAATAAG 



3951 ATGATTTTGT 7GATAAAACC CTTCCGGACC AGGGGGACAA TGACAATTGC 
TACTAAAACA ACTATTTTGG GAAGGCCTCG TCCCCCTGTT ACTGTTAACC 



4001 TGGACAGGA7 CCAGACAATG GATACCGGCA GGTATTGGAG T7ACAGGCCT 
ACCTGTCCTA CCTC7G77AC CTATGCCCGT CCATAACCTC AA7CTCCGCA 



<051 TATAATTCCA GTTATCGC77 TA7TCTGTA7 ATGCAAATTT GTCTTTTAGT 
ATATTAACGT CAATAGCGAA ATAAGACATA TACGT77AAA CAGAAAATCA 



4X01 TTTTCTTCAG ATTGCTTCAT GGAAAAGCTC AGCCTCAAAT CAA7GAAACC 
AAAAGAAGTC TAACGAAGTA CCTTTTCCAG 7CGCAG7T7A GT7AC777GG 



4151 AGGATTTAAT 7ATATGGAT? AC7TGAA7C7 AAGATTACTT GACAAATGAT 
TCCTAAATTA ATATACCTAA TGAACTTAGA TTCTAATCAA CTG777AC7A 



4201 AATATAATAC AC7GGAGC77 TAAACATAGC CAATGTGATT CTAACTCCTT 
TTATATTATG 7GACCTCGAA A777G7ATCG GTTAGACTAA GATTCAGGAA 



4251 7AAACTCACA GTTAATCATA AACAAGGTTT GGAATTGATC 7GCTGTGCCT 
AS77CAGTG7 CAA77AC7A7 7TGT7CCAAA CCTTAACTAG ACGACACGGA 



4301 TCXAGTTGCC AGCCATC7GT TCTTTGCCCC TCOCCCGTGC CTTCCTTGAC 
AGATCAACGG TCGGTAGACA ACAAACGGGG AGGGCGCACG GAAGGAACTG ' 



4351 CCTGGAAGGT GCCACTCCCA CTGTCCTTTC CTAATAAAAT CAGGAAATTG 
CGACCTTCCA CCCTGAGOCT GACACGAAAG CATTATTTTA CTCCTTTAAC 



4401 CA7CGCATTG TCTGAGTACG TGTCATICTA 77C7GGCGGG TGGGGT6GGC 
CTAGCGTAAC AGACTCA3CC ACAGTAAGAT AAGACCCCCC ACCCCACCCC 
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* » J A 


CACCACAGCA AGGGCGAGGA TTGGGAAGAC AATACCAGGC ATGCTGGGGA 
GTCGTCTCGT TCCCCCTCCT AACCCTTCTG TTATCGTCCG 7ACOACCCCT 










4501 


TCCGGTCGGC TCTATGGGTA CCCAGCTGCT -GAAGAATTCA OCCGGTTCCT 
ACGCCACCCG AGATACCCAT GGGTCCACGA CTTCTTAACT GGGCCAAGGA 




45S1 


CCTGGGCCAG AAAGAAGCAG GCACATCCCC TTCTCTGTGA CACACOCTOT 
GGACCCCOTC TTTCTTCCTC CCTCTAGGGC AAGAGACACT -GTCTCGGACA 




4*01 


CCACCCCCCT GGTTCTTAGT TCCAGCOCCA CTCATACCAC ACTCATAOCT 
CCTGCGCCGA CCAAGAATCA AGGTCCCGGT GAGTATCCTG TGAG7ATCGA 




4651 


CAGGAGGGCT CCCCCTTCAA TCCCACCCGC TAAAGTACTT GGACCCGTCT 
GTCCTCCCGA GGCGGAAGTT AGGOTGGGCG ATTTCATGAA CCTCGCCAGA 




4701 


CTCCCTCCCT CATCAGCCCA CCAAACCAAA CCTAGCCTCC AAGAG7GGGA 
GAGGCAGGGA GTAGTCCGGT GGTT3GGTTT GGATCGGAGG TTCTCACCCT 




4751 


AGAAATTAAA CCAAGATAGG CTATTAAGTC CAGAGGGAGA GAAAATGCCT 
TCTTTAATTT CGTTCTATCC GATAATTCAC <5TCTCCCTCT CTTTTACGGA 




4B01 


CCAACATGTC AGGAACTAAT CAGAGAAATC ATACAATTTC TTCCGCTTCC 
GGTTGTACAC TCCTTCATTA CTCTCTTTAG TATCTTAAAG AACGCGAAGG 




4851 


TCCCTCACTG ACTCGCTGCG CTCGCTCGTT -CGGCTGOGGC GAGCGGTATC 
AGCGAG7GAC TGAGCGACGC GAGCCAGCAA GCCGACGCCG CTCGCCATAG 




4901 


AGCTCACTCA AAGCCGG7AA TACGGTTATC CACAGAATCA GGGGATAACC 
TCGAG7GAGT TTCCGCCATT ATGCCAATAG GTGTCTTAGT CCCCTATTGC 




4951 


CAGGAAAGAA CATGTGAGCA AAACGCCAGC AAAAGGCCAG CAACCGTAAA 
G7CCTTTCTT GTACACTCGT TTTCCGCICG TTTTCCGGTC CTTGGCATTT 




5001 


AAGGCCGCG? TGCTGGCGTT TTTCCATAGG CTCCGCCCCC CTCACGAGCA 
TTCCGCCGCA ACGACCCCAA AAAGG7ATCC -GAGGCGGGGG GACTGCTCGT 




5051 


TCACAAAAAT CGACGCTCAA GTCAGAGGTC -GCGAAACCCC ACAGGACTAT 
AGTGTTTT7A GCTGCGAGTT CAGTCTCCAC CCCTrtGCGC TC1CCTGATA 




5101 


AAACATACCA {JGCGTTTCCC -CCTGGAAGCT GCCTCGTGCG CTC7CCTCTT 
TTTCTATGGT CCGCAAAfiGG GCACCTTCGA GGGACCACGC CAGAGGACAA 




5151 


CCCACCCTGC -CGCTOACCGC ATACCTCTCC «CTTTCTCC CTTCCGGAAG 
GGCTGGOACG GCGAATGCCC TATGGACAGC -CGGAAAGAGG GAAGGCCTTC 




5201 


CGTGGCGCTT TCTCAATGCT CACGCTCTAG GTATCTCAGT TCGOTGTAGG 
CCACCGCGAA ACAGTTACGA GTGCGACATC CATAGACTCA AGOCACATCC 




5251 


TCGTTCGCTC CAAGCTCGGC TCTGTCCACG AftGCCCCCGT TCAGOCCCAC 
AGCAAGCGAG <3TTCGACCCG ACAGACGTCC TTGCGGGGCA AGTCGGGCTG 




S301 


CCCTGCGCCT TATCCGGTAA CTATCGTCTT CAGTCCAACC CGGTAAGACA 
CCGACGCGGA ATAGGCCATT -GATACCAGAA CTCAGGTTCG -GCCATTCTCT 
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5351 CCACTTATCG CCACTGGCAG CAGCCACTOG TAACAGGATT AGCAGAGCCA 
GC7GAATAGC GGTGACCCXC GTCCGTOACC ATTGTCC2AA TCGTCTCGCT 



5401 GCTATGTAGG CGCTOCTACA GAGTTCTTGA ACTCOTGGCC TAACTACCGC 
CCATACATCC GCCACGATGT CTCAAGAAC? TCACCACCGG ATTCATGCCG 



5451 TACACTAGAA GCACAGTATT TGGTATCTOC CCTCTGCTCA AGCCACTTAC 
ATGTCATCTT CCTGTCATAA ACCATAGACC CGAGACCACT TCGCTCAATC 



5501 CTTCGGAAAA AGAGTTGGTA GCTCTTCATC CGGCAAACAA ACCACCGCTG 
GAAGCCTTTT TCTCAACCAT CGAGAACTAG CCCGTTTCTT TGGTCCCGAC 



5551 CTAGCGCTGG TTTTTTTGTT TGCAAGCAGC AGATTACGCG CAGAAAAAAA 
CATCGCCACC AAAAAAACAA ACCTTCGTCC 7CTAATGCGC GTCTTTTTTT 



5601 GGATCTCAAC AAGATCCTTT GATCTTTTCT ACGGGGTCTG ACGCTCAGTG 
CCTAGAGT7C TTCTAGGAAA CTACAAAAGA TGCCCCAGAC TGCGAGTCAC 



5651 GAACGAAAAC 7CACGTTAAG GGATTTTGGT CATGAGATTA TCAAAAAGGA 
CTTGCTTTTG AGTGCAATTC CCTAAAACCA GTAC1CTAAT AGTTTTTCCT 



5701 TCTTCACCTA GATCCSTTTA AATTAAAAAT GAACTTTTAA ATCAATCTAA 
AGAAGTGGA? CTAGCAAAAT TTAATTTTTA CTTCAAAATT TAGTTAGATT 



5751 AGTATATATG AGTAAACTTG GTCTGACAGT TACCAATGCT TAATCAG7GA 
TCATATATAC TCATTTGAAC CAGACTCTCA ATCG7TACGA ATTAGTCAC7 



5801 CCCACCTATC TCAGCGATCT GTCTATTTCG TTCATCCATA GTTGCCTGAC 
CCGTCGATAG AGTCGCTAGA CAGATAAAGC AAGTAGCTAT CAACGGACTG 



5851 TCCCGGGGGG GGGGCCGCTG AGGTCTGCCT CGTGAAGAAG GTCTTGCTCA 
AGGCCCCCCC: CCCCCGCGAC TCCAGACGGA GCACTTCTTC CACAACGACT 



5901 C7CATACCAG CCCTGAATCG CCCCAICATC CACCCAGAAA GTCAGGGAGC 
GAGTATGGTC CGGACTTAGC GGGGTAGTAG GTCGGTCTTT CACTCCCTCG 



5951 CACCG7TGAT GAGAGCTTTG TTGTAGGTGC ACCAGTTGG7 GA7TTTGAAC 
G7GCCAACTA CTCTCGAAAC AACATCCACC TGGTCAACCA CTAAAACTTG 



6001 TTT3GCC7TG CCACGGAACG GTCTGCGTTG TCCGGAAGA7 GCGTGATCTG 
AAAACGAAAC GGTGCCTTGC CAGACGCAAC AGCCCTTCTA CGCAC7AGAC 



€051 ATCCrTCAAC TCACCAAAAC TTCCATTTAT TCAACAAAGC CGCCCTCCCG 
TAGGAAGTTG AOTCGTTTTC AAGCTAAATA A GTTOTT T CC GCGGCAGGGC 



6101 TCAAGTCAGC GTAATGCTCT CCCAGTCTTA CAACCAA7TA ACCAATTCTC 
ACTTCAGTCG CATTACGACA CGCTCACAAT GTTGGTTAAT TGGTTAAGAC 



6151 ATTACAAAAA CTCATCGAGC ATCAAATGAA ACTGCAATTT ATTCATATCA 
TAATCTTTTT GAGTAGCTCG TAGTTTACTT TGACCTTAAA TAAGTATAGT 



6201 GGATTATCAA TACCATATTT TTGAAAAAGC -CGTTTCTGTA ATGAACCAOA 
CCTAATAGTT ATGGTATAAA AACTTTTTCG GCAAAGACAT TACTSCCTCT 



6251 AAACICACCG AGCCAGTTCC ATAGGATGGC AAGATCCTGG TATCGGTCTO 
TTTGAG5GGC TCCGTCAAGG TATCCTACCG TTCTAGGACC AtAGOCAGAC 
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€301 CGAT7CCGAC TC6TCCAACA TCAATACAAC CTAT7AAT7T CCCCTCCTCA 
GCTAAGOCTG AGCAGGTTCT AGTTA7CTTO GATAATTAAA GGCCAGCAGT 



6351 AAAAT AAGGT TATCAAGTCA OAAATCACCA TOACTCACOA CTCAATCCGG 
TTTTATTCCA ATAGTTCACT CTTTAG7GGT ACTCAC7CCT GACTTAGCCC 



Hindizz 

6401 TGAGAATGGC AAAAGCTTAT GCATTTCTTT CCAGACTTGT TCAACAGGCC 
ACTCTTACCC TTTTCGAATA CGTAAAGAAA CGTCTGAACA AGTTGTCCGG 



€451 AGCCATTACG CTCGTCATCA AAATCACTCG CATCAACCAA ACCGTTATTC 
TCCGTAATGC GAGCACTAGT 7T7AG7GAGC GTA QTTGGT T TGGCAATAAO 



PVUl 

€501 ATTCGTGATT GCGCCTGAGC GAGACGAAAT ACGCGATCGC TGT7AAAAGG 
TAAGCACTAA CCCGGACTCC CTCTGCTTTA TGCGC7AGCG ACAATTTTCC 



€551 ACAATTACAA ACAGGAATCG AATGCAACCG GCGCAGGAAC ACTGCCAGCG 
TG77AATGTT TCTCCTTAGC TTACGTTGGC CGCCTCCTTG TGACGGTCGC 



€601 CATCAACAAT ATTT T C AGCT CAATCAGCAT ATTCT7CTAA TACCTGGAAT 
GTAGTTGTTA 7AAAAGTGCA C77AGTCCTA TAAGAAGA7T ATGGACC7TA 



€«i gctgttttcc cggggatcgc agtggtgagt aaccatgcat catcacgagt 

CGACAAAAGG GCCCCTAGCG TCACCACTCA T7GG7ACGTA GTAG7CCTCA 



€701 ACCGATAAAA TGC77CATGG 7CGGAAGAGG CATAAATTCC GTCAGCCAGT 
TCCCTATTTT ACCAACTACC AGC C TTCTCC GTATTTAAGG CAGTCGGTCA 



€751 TTAGTCTGAC CATCTCATCT CTAACATCAT TGGCAACGCT ACCTTTGCCA 
AATCAGACTG GTACAGTAGA CAT7GTAGTA ACCGTTGCGA TGGAAACGG7 



€801 TCTTTCAGAA ACAACVCTGG CGCA7CCGGC TTCCCATACA ATCGATAGAT 
ACAAAGTCTT TGTTGAGACC GCGTAGCCCG AAGGGTATGT 7AGCTA0CTA 



C€51 TGTCSCACCT GAT7GCCCGA CATTATCGCG AGCCCATTTA 7ACCCA7ATA 
ACAGCGTGGA CTAACGGGCT G7AATAGCGC TCCGGTAAAT ATGGGTATAT 



Xhoz 

€901 AATCAGCATC CATGTTGGAA TTTAATCGCG GCCTCGAGCA AGACGT7TCC 
TTAGTCGTAG G7ACAACCTT AAATTAGCGC CGGAGCTCGT TCTOCAAAGC 



€951 CGTTCAATAT CCCTCATAAC ACCCCTTGTA TTACTGWTA TGTAAGCAGA 
GCAACTTATA CCGAGTATTG TQGGGAACAT AATGACAAAT ACA7TCGTCT 



7001 CAGTTTTATT CTTCATCATG ATATATTTTT ATCTTGTGCA ATGTAACATC 
GTCAAAATAA CAAGTACTAC TATATAAAAA TAGAACACGT TACATTGTAC 



DraXXX 

7051 AG7GATTTTG agacacaacg tcgctttccc ccccccccca TTATTGAAGC 

TCTCTAAAAC TCTGTGTTGC ACCGAAAGGG GGGGGGGGGT AATAACTTGG 



7101 ATTTATCACG GTTATTGTCT CA7GAGCGGA TACATATTTG AATGTATTTA 
TAAA7AGTCC CAATAACAGA GTACTCCCCT ATGTATAAAC TTACATAAAT 
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7151 C *** AAT ** A CAAATAQGGG TOCGCGCAC ATTICCCCfll XAX^rir 
CTTTTTATTT GTTTATCCCC AAflOCGCCTC TAAAOGGGCT TTTCAOGGKJ 

7201 CTGACGTCTA AGAAACCAT? ATTATCATCA CATTAACCTA 7kXXXXTXGQ 
^ ^GACTGCAGAT TCTTTGGTAA TAATAGTACT GTAATTCGA? ATTTTTATCC 

7251 CCTATCACGA CGCCCTTTCG TC 
GCATAGTCCT CCGGGAAAGC AO 
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